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In this Issue 

One of the most competitive areas in the personal computer market tod ay is the 
race to provide printing solutions to meet the needs of the entire family. The 
specifications for a successful printer in this arena inctude technologies that 

provide continuous improvement in throughput and print quality, low cost, 
attractive small size, quiet operation, ease of use. and designs that lend them- 
selves to high-volume production. 

Design teams at Hewlett-Packard divisions that are responsibte for HP color Inkjet 

printers decided to take a phased approach to meet the challenges posed by 
these speciticatians. The HP DeskJet 820C (page 6) is the first product resulting 
from this evolutionary product plan. The DeskJet 820C contains a writing system, print mechanism, and 
package leveraged from the HP DeskJet S50C and a new electronic, firmware, and software architecture 
celled the Printing Performance Architecture (PPA). 

PPA grew out of the recognition that newer generations of personal computers have the bandwidth to 
take on some of the computing tasks typically relegated to the printer, and many software applications 
are rapidly moving away from MS-DOS "to a Microsoft Windows"' environment. With this realization, 
the design teams developed a software, firmware, and digital electronics architecture that uses the 
computing resources of the PC instead of duplicating these resources in the printer. This architecture 
helped to lower the cost of the printer by reducing RAM from IM bytes to 12aK bytes, ROM from 2M 
bytes to 64K bytes, and the gate count of the largest ASIC by 25%. 

Wfththe reduction in the logic-supporting hardware in the DeskJet 82CIC, printer functions such as swath 
cutting and data formatting were moved into the software driver. The article on page 12 discusses the 
design of the PPA printer software driver, which implements functions traditionally found in the printer, 
handles PPA communication between the host and the printer, and provides PCL emulation for DOS 
application support. 

Because so many printer functions are implemented in the host software driver, fewer functions are 
needed in the firmware for the DeskJet 820C. As described in the article on page 22, "Don't touch the 
dots" was the firmware designers' golden rule. This means that hrmware in the printer was designed so 
that it is only responsible for taking the formatted data from the host and sending commands to the motor 
and print cartridge to place the dots at the appropriate places on the paper. The printer firmware is also 
responsible for user interface and status functions. 

ASiC development for the PPA printer controller and the inkjet printhead drive electronics is described 
in the articles on pages 31 and 38 respectivety A typical digital controller for a printer contains a micro- 
processor to control the printer RAM for incoming data, ROM for firmware, and custom logic for printer- 
specific functions. For the DeskJet B20C, these functions were integrated on one chip and optimized to 
meet the requirements of PPA, The pen drive electronics are responsible for driving signals to eject the 
ink from the pen and providing a control system to maintain a constant temperature in the active area of 
the pen. For the DeskJet 820C's pen drive electronics, the functions of four ICs were integrated in one 
chip, and ail the electronics related tu the pens were moved onto the carriage's printed circuit assembly, 

Today, key design decisions associated with developing a microprocessor not only focus on technical 
requirements such as a higher speed, taut also on business and marketing requirements. A few years 
ago HP began developing a line of PA-RISC processors to meet the needs of higher-volume and more 
cost-sensitive products. The article on page 43 introduces four articles that describe the latest proces- 
sor in this line, the HP PA 7300LC. The HP PA 7300LC processor is optimized for entry-level to midrange 
high-volume systems such as workstations and servers. 

The PA7300LC processor is the result of leveraging the superscalar CPU core from the HP PA 71D0LC 
processor, adding a large embedded primary cache, and reducing the chip area and pipeline stalEs. The 
article on page 48 describes the PA 7300LC microarchitecture, the CPU core, and the memory and I/O 
controller. The leveraging effort, the chip area reduction, and the redundant cache RAM arrays are 
discussed in the article on page 61. 
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f^o matiar ficw mucli levaragmg ^s dnne or tiow mature the IC fabrication process, funationaf verificdtion 
of a new chip is always an important step in the process. The article on page 6S describes the processes 
used in the presilicon and postsilicon phases lo verify the correctness of the PA 7300LC processor. 

The HP 9000 D-class server (page 73) and the HP 9000 B-class workstation Ipage 82) are examples of 
products that use the PA 73O0LC processor. The D-class server is targeted for the high-volume environ- 
ment of departmenial and branch comptjting. The article includes a comparison between different models 
of D'Class servefs that use HP processors other than the PA 730DLC, The HP 9000 B-class workstation \s 
comparably priced to the HP SOOO Model 715 workstation but has superior performance and I/O capabili- 
ties. The article focuses on how cooperative engineering between the various entities involved in product 
development helped to reduce the time to market for this product. 

Software testing is always one of the most critical phases of the software development process. If test 
plannmg fs late or madequate. the test effort can cause late, or worse, low-quality products. The level of 
testing and the pass/fail criteria vary with the type of software. For example, software used in video games 
would not be tested the same way or have the same pass/fail criteria as software used m patient monitors. 
The articles beginning on page 89 describe the processes, languages, and tools the authors have devel- 
oped for testing safety-critical software. In this case the safety-critical software involves software used 
in the HP OmniCare pahent monitors, which monitor the physiological parameters of critically ill patients. 

The evolution of the software testing process for the HP OmniCare patient monitors and resulting test 
tooling called testWBre are described in the first article. The next article (page 95) describes a high-level 
programming language called ATP [Automatic Test Processor), which allows the integration of existing 
test processors used for validation. The AutoCheck program (page 103) evaluates test files and documents 
the results of the evaluations. The final article (page 1091 describes how these test tools can be used to 
help in testing localized software 



C.L Leath 
Managing Editor 



Cover 

The cover shows an artistic rendition of the change in the printing model brought on by the Printing 
Performance Architecture (PPA} implemented in the HP DeskJet 820C. The top figure depicts prmtmg 
before the PPA where most of the printing logic resides in the printer The lower figure depicts printing 
after the PPA where most of the printing logic resides in the host computer. 



What's Ahead 

Featured in our August issue will be: 

• The design and verification of the HP PA 8000 and PA 8200 four-way superscalar CPUs 

• The HP OpenCall family of telecommunications platforms based on intellrgent network concepts 

• Software to test policing in ATM networks 

■ An object-oriented database management system for large historical data archives 

• The HP 4500 taenchtop inductively coupled plasma mass spectrometer 

■ Five papers from the 1996 HP Design Technology Conference. 



Reminiler 

Because we are getting ready for a new Journal design and focusing on other projects, we won't be 
pub[ishing an issue in October. 



.Itiih' IM5>7 lli*wloll-P«ckfirfMtTririi;il 



)Copr. 1949-1998 Hewlett-Packard Co. 



A Lower-Cost Inlget Printer Based on 
a New Printing Performance 
Architecture 



The HP DeskJet 82DC printer is the first HP inkjet printer in an 
evolutionary product plan that takes advantage of computer and operating 
system trends to make inkjet printing affordable for more users. The 
printers integrated software, firmware, and digital electronics 

architecture uses the computational resources in the PC instead of 
duplicating these resources in the printer. 

by Da\dd J. Shelley, James X Majewski, Mark R, Thackray, and John L, McWliliams 



Tlic Iwo flewlcU-Packard divisions in Vancom cr, Washia^ton 
are rcspoiisiblo ror PHtablishiiig and maintiiining HP c^olor 
inkjtH IJii raters its inarkH leadin^^ iKTsnnal phkIlh Is in Ihe 
home and olflre ivoniputiTij^ envirojinienls. Tlu\se divisicjns 
have a ten-year JusU^ry of successful products slaning wii h 
the original HP DeskJet primer in U)8H and eulnunafing 
most recently Hith \he intnjdnctlon ol'tlie new HP Desk^Jet 
S20C^ (Fig. Din the spring of 1996. 

C)nr ( onipetiUJis, ofconrseH liave also l)een introducing 
producLs, some orwiiicli incorijorate newly developed let^Ih 
nologies that strongly challenge the perlonnance, print qual- 
ity, anfl cost "effectiveness of otn own. It is clear that our 
conifielitors are here for the long term, so we must develop 
long-term strategies to compete with them. 

Aside from competition, we also have before us an excellent 
opi>oi1 unity to hroaden our printing solutions to emhrace 
tiie needs of the entire family, a step well heyond the tradi- 
tional '*take W'ork home'' professional who has been our 
mainstay home ciistomtT. These new custom el's have dis- 
tinctly diirercMU needs that will require insight fnl unrlor- 
standing as well as tunely incorporation of focused innova- 
tions in our products. 




Fig, 1. [ir^Df-sk-Irt 8201" color iiikjel printf^r. 



At the beginning of the HP DeskJet 820C project, it wiis 

clear that our ability to retain and grow our market leader- 

shii) deiieiided heavily upon our ability to deal with these 

two powerlul mai ket d>itamics. We knew^ that we hati to 

simultaneoiLsly stay ahead of the competition and satisfy 

t he rapidly increasing hreadtli of home jjrintijig needs. The 

ing! edients for long-teiin success in this endcaA or w^ere 

equally clear: 

Technologies that result in continuously improving print 

tln'oughpnt and quality 

Designs that cam adequate profit s at redncetl t ustonier 

prices 

Designs that appeal to home customers by vdrtue of small 

siy^e, ;it tractive indtistrial desigits, very quiet operation, and 

unparalleled ease of use 

Designs capable of high-vokune produt tion at nmltiple 

international factory sites 

The ability to design proiluct^ to hit naTTcnv market 

windows. 

We reahzed tliat no single product program could success- 
fully satisfy ail of these* criteria, so we needed to develop a 
jihased approaclr We decided that each ncw^protkul devel- 
opment effort should levera|*e previous capabi lilies while 
incorijorating a .small set of new and innovative capal)ilirics 
focused on oin' customer needs. These new^ capabilities 
would then be le^e^agetl t'orward into succeeding efforts. 
In this fashion we could ensure a timely series of product 
introiliutions, each building upon pre\ious successes and 
incrementally providmg new capabilities tlia! would ulti- 
mately satisfy all of our strategic initiatives. In addition to 
U^e market tinnMiness gained by a phtiseil ajjp roach, we also 
knew- that diis pkm would use scarce development resources 
m the most efficient nuumcr. 

Design Objectives 

Tlie MP 1 JeskJet S20C printer is the first product in this evo- 
lutionaiy product plan. In keeping with our overall slraiegyt 
the primar>^ <ibjecti\es of the de\ elt)i>ment program were to: 
Leverage the speeti dml print ciuality alTurded Ijy the new 
writing system developed for the IIP DeskJet 850C 
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• Leverage the printing mechanism of the HP DeskJet 85(K' 

• Ijino%^ate bj' offering this printing capaliUiiy at a gp^eatly 
n?duced price for home customers 

• Introduce in the spring of 1996. 

While reduced cost and a spring 1996 intrmltieiioii were 
clearly the primaij- objet^tives for tiie IIP DesLIei H2tK? elfon. 
we also derided to begin our journey towards consumer 
<lesigii by Trtaking mdusTrial d<*sign chatiges thai ^} i^ithin 
tlie constraints of a leveraged nuH-hanism ai>d package. 

Since we had decided to leverage the writing system, prim 
mechanism, and package, we needed to examine the elec- 
tronic, fiimware. and software dri\^er subsyslents to find cost 
rt»riuction opportunities. Bastxl on our iriirial in\'esugation 
we set a program goaj to retkice dirc^ct material c-ost by 2WL 

Design Approaches 

Our Orst design tactit^ renxjgnized two trends. Fii-st. newer 
generations of perHiJmil computers ha\'e more (han t*n(5ugh 
bandwidth lo take on some of the computing ioini liiai ha^ 
iinril now resiflefl in the printer itself. Second software ap- 
pli( ations aie rapidly moving away from MS- DOS ^ ami into 
tl\e Mlcrcjsoft Wiiidnwis " environmeoL In view oT these 
trend^s, we decided that the IIP DeskJet S2i)t' jjnnter would 
not support printing frcmi standalone DOS appUcations. TliLs 
enabled us to develop an inlegratefi snriware. firmware, and 
digital electronics arcliitecrtire liiai uses the computational 
resoiu'ces in the FV instead of dnplic aring these resources in 
the printer. Wc call the architecture Prinfing Peffonnancv 
ArckUectuj^, or PPA. Tliis archilectural c!hoice enabled us 
to at hi eve half of our 30% cost rediiction goal by reducing 
RAM froju IM l).vles lo 128K bytes, ROM from 2M byle.s U> 
64K bytes, ajul gate count iji our laigest ASIC by 25%. At tlie 
same linie, liiglier-power PC-s enatjled us to maintain and hi 
many ceases improve system throughjjut, 

A second critical design decision was lo fJisallow simulta- 
neous firing <if ttu* blat k and color [jrint catlridges dui irig a 
slnglt^ print swuIIl WhiU" tlus siraiegy achieved an additional 
2{M of our overall gfuil. tli*' t>bvious risk was a reduction in 
throughput for docnments that r-ontain juxtiiposed black 
and rotor t1owc%^en we felt that our new system architec- 
ture would mitigate this risk lunl still alli^w us lo nieei our 
performance oi>jeciives. This single itecisiou allowed us to 
simplify the dri\'e eletlrfuiics for tJie print call ridge to the 
point where Ihey <'ouht be ItK-ated on a smalls carriage- 
moimted prinled circuit a^isembly rathtT than oji itie main 
logic printed circuit assemlily. This cliange, in Mini, euahled 
tw^o otlier very .'^ignifu ant cost reductions. Fii'st^ the inlet face 
between the logic and carriage printed circuit assenibiie.s 
was dniniatit ally simplified, allotting the ust^ of statu J^mi 
and easily availatik* cables anri comtecfor^; rather than I he 
custran ilesigns that we had previously used. Second, tising 
this new patlitloning of mialog limctiram. the riesigtt team 
was able to implement the required capal>ility using t%vo 
custom analog ASICs in conti'asl to the four iluU I tad been 
u.sed in the DeskJet 850C. 

An add] IK mat ItyMMir out^ cost goal was achieved by capitaliz- 
ing on tliree cost saving op]>ortunities in our power suj^ply. 
First, tile inirial UP De^kiJet 850 power .supply was specilled 
with signifieant margin to allow flexibility fr^r the newly ile- 
veloped writing sysleni iu Ihal producl. Ihiwev(*r, tlu^ HI* 
De.st(.Jet rS2lK develcjpnnin lerun liad Ihe advajilagtMif a 



stable writing system and therefore could specify power 
needs ntore prec^Lsely. Second, we mofiified I he user inler- 
action nioriel with ihe printers iK>wer functions and w^ere 
able to eliminate some of the complex capalrilities that were 
inchided in the IIP DeskJet S5tl Tfiird. we specified our 
power supply at a ver>^ high le%e! of abstraction to use the 
d^gn expertise of our vendor base to deUver cost^ptimal 
implementations. 

Several sources coniributed to the fifia] 20%* of our cost 
reduction goal. Our new s^tem architeciure and new parti- 
tioning of juialog functionalitj^ allowed a significant reduction 
111 the size of oiu printed circuit asseml>lies. Direct material 
cost .sa\ings were realized by elimination of tlie <'omiectors 
and suppon components for intercomieciing to Apple PCs. 
Focused design work to cost-opiimize our EM! and ESD 
solutions elimmated m;uiy discrete electronic components. 

As a result of our plan to leverage and our focus on limited 
but meaningful irmovation, tlie ilF Desk-Jet 82t)C wasintro 
duced to file tnarket on schedule in the spring of 19ftfi fol- 
lowing a de% elopment effort Utat exceeded objectives by 
achieving a 33% fibect material cost reduction and actual 
jjerformance nearly twice our Initial expectations. The 
re4-hni<jues responsible for this success have been carried 
tbrw^ard and iue already incorporated into the next prod- 
ucts in our evoluiionaiy process. 

Printing Performance Architecture 

Tire prot^ess of printing a documetit created on a comr>uter 
involves several steps to transform antl prepare the infornia- 
tion. Ill the traflitlonaJ \\indows model used by m\4^t print- 
ers, the [u'lnler driver softwaie receives a des<Ti|Jtion of die 
page IVoin i he application, traustbiins lliat tlescription into 
a i\ieciianism independent formal tliat ctui he understood by 
die printer, and encodes it into asttmdard i>rinter language. 
The encoded rU'scriptitni is then (ransferretl to the printer. 
Tlie ininler dec odes the data and fin n in is 11 for its panic iitar 
]jrinling met hanism. To encfide the inftinnation lor transrer 
lo Ihe pjinlet; Hewlett-Packard devehjped a stantlard lan- 
guage called PCJL (Printer ('ontrol Language). Because of 
Ihe widestjrcad use of MP r>riutei>i, this language has becoine 
a de facto stanciarti, P( i> allows tiie (^tnn| inter to prepare im 
image for printing witbottt detailed knowledge of the nie- 
c^banical details of the (jririter. 

For tlie Microsofi Windows cnviromiienl. IU* has always 
develo|ie<l the stjftware driven? for its inkjet printers, bi the 
Windows nioilei, the application sends a |>age description to 
the driver tin nugh the operating system. Tlte desiription is 
in the fonti of drawing objects (lines, rectangles, text, etc.). 
The dri%'er then rastc^izes tbc^ descriptifin. Rasterization is 
the process of nia[tpiuj^ tht^ P^igt^ <fescription to im X-V plane 
or bitmap. At Ibis point, the da I a still must nndergu several 
more transh>niiattoiLs t>efore it cm\ be ilsihI to print. For ex- 
ample, the first bitmap may be 24-bit tiata at 30tl dpi, where- 
as the i!il<;iet meclianism may be 6(K) rlpi and only able to put 
one of Ibm rokam at each f?ixel (black, cyart, magenta, yel- 
low). Traditionally, the r I river tii^rlormerl some otthe needeci 
inmsfonnations, but tefi many irf the more compute-in tensive 
ones to deditated hardware and nrmware in the prlnten 

After the driver has performed all t^f its computations, it 
encc^di'H tlu^ iid'onnation tising lh(^ sul>sel ofPCb needed for 
Inlmaj^peil data. The pdnler iu lurn decodes the PCLmul 
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perfoniis all of the iiecfssaiy fmliiprcrnnpiitatirHis in fonrval 
Die data for the prmtjiig mechaiiisni. Manipulatioii.s t.hf 
printer must perform include, hut are not limited to, some 
color transfomiations, cutting the data int<:» indiviciua) s^^aths, 
and separating the data into colimuis (inKjet caitiidges are 
composted of two columns of staggered nozzles). This 
process is diagrammed in Fig. 2. 

The process in the MS-DOS envirtjiimcol is siuiilai- with two 
exceptions. One, the appiication must perform die rasteriza- 
tion for all graphics rnul PCL encoding, imd two, the printer 
will accept nom*asterized text, ;:iiloviating the need for t)ie 
application to do it. Because of this second difference, pre- 
vious ink;jel i>rintt'rs weix^ rcquireci to have extensive nieniory- 
intensive fonts tuiilt into tliem. hi addition, the printer hati to 
cotuain (Irinwart^ and htirdwaie to rasterize tlie fonts. Since 
tlie data inanipuhilions performed were extensive, they 
required a powerful mieroproce^or and significant amounts 
of dedicated iiardware. 

The concept of the ne%T Printing Perfonnance Archit ect itre, 
or PPA^ is to change tJiis model hy ehniinating some of tlie 
steps. Because modem pei^onal computers hav^e j>owe!'fu] 
micTOprocessoi'S antl a large amount of system memory^ the 
iBsk of data fonuatting for the print mechanism is moved 
entirely to the host computer Also, because the data is no 
longer in a PC'I^comiialible fomiat, PCL is not iLsed to trans- 
mit the data to llie prinler histead. a ver>^ simple proprietaiy 
protocol was develtjpetL The protocol is simple cnougli that 
tlie hardware can atitomaticaJly depacketize the data witliout 
help from the finnwaie. Tlie fiata is then directly tised to print 
the image on the ])age. This process is diagrammed in Fig. ;J. 

Advantages of PPA 

Tlie prlmaiy- ad\'ant.ages of PPA are cost and performance. A 
PPA printer c^m deli% er tierformance siuular to a traditional 
non-PPA piintei- at a reduced cost. Alteniativeiy, it can deliver 
Ingher levels of performance at a similar cost. The reasons 
for the cost advantage fall uito two areas: less niemoi^- is 
requiied (lioth RAM and ROM), and a lower-perforrtrntK e 
microprocessor can he used in the prinier i>ecause the micro- 
processor doesn't have to touch the data. 

Memoiy costs are a stgi\ificant portion of the material cost 
of a loM^-ejui printej; A PPA prhifer requires significantly less 
ROM ai\d RAM. First, the PPA [mnter doesn't have to store 
any internal fonts. Traditional primers supported both the 
Windows environment and the DOS en\ iron men t. The print- 
ing model in the DOS environment requii'es the printer to 
Store font information* A DOS apphcation sends an ASCII 



code for tlie desircti text character. To t)rint that character, 
thepiiiUer nt^etls a bitmap for that cliarar ter in its ROM. 
in contrast, apijlicarions in the Windows environment send 
only bitmapped grapliic information to the printer, never 
ASCI] text. Because the PPA printer Ls tlesigned exclusively 
for I tie Wintlows environment, it doesn't need to store ilie 
fonts in ROM. 

Second, because the [printer tloesn t do any PCL decoding, 
swath cutting, or data fomiattuig, the printer requires much 
less firmware, a^ain saving ROM. Tlie [jrimaiy fiaictions of 
the prinier llmiware aje mechanism control, mpuL/ontpnt. 
and the user iJiterface. In the IIP DeskJet S2iK\ the finnwai^e 
is stored in only MK bytes of ROM. Because there is so titile, 
il was liossible to integrate the ROM into the digital ASIC. 
Pte\ious non-PPA i^rinters of similar capabiUty used 512K 
iiytes or more of ROM, 

Finally^ because the processor doesn't tosich ttie data and 
doesn't need to create any mtemiediate forms of the image 
data, the printer re^quires less RAM. Tfie HP DeskJet 820C 
uses a 128K-byte EJRAM. The t^revious generation, non-PPA 
printer used 512K to IM bytes of RAJVL Because Itiere are 
lewer niemojy IC s, the niemoiy cost for a PPA printer is 
much lower. The reduced number of memory devices also 
reduces the printed circuit board area, again saving cost. 

The second factor in saving cost conies from (he need for 
less microprocessor Iiorsepower. In a PPA printer* tlie pro- 
cessor does not do swath cutting anfi foi'jnaning of t he rlata. 
Its primary fiuictions me mechanism control, inpiityoLUput, 
and the user interface. This requires a less complex and con- 
sequently less expensive microprocessor. The HP Desklet 
820C uses a Motorola 68RC0(J(}. Tiu^ t>8E( 000 ctm be crmfig^ 
lued with either aj\ S-I)il or a 16-bi( flata bus. In the HP Desk- 
Jet S20C, the processor is used in S-bit mode. This reduces 
die bus width in the digital custom ASIC, again sa\ing area 
and hence cost- 
Finally, because of the simphfied data path in die printer (tlie 
data path is the path the data takes from tlie input/output 
poiT, through the ASIC, and out to the print cartridge), it wbs 
possible in the HI* DeskJet 820C to design a data path in 
which the processor doesn't touch the hnage data. A dedi- 
cated liaiTlwaie data path h always mtich faster, albeit less 
flexible, than a data t^ath in which the pitjcessot nmsl triins- 
form or handle the data. A full haidwar e data path is not 
limited to a PPA tucliitectiu'e, hut is nntch easier to accom- 
plish m a PPA jjrinter because of the simi>iirietl data path. 
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ChaUenge** of PPA 

\Miik» PPA has tnn\w sipiific^ant ad%^ajitages. it also brings 

with ii some challenges: 

' Df >S m supported only through WIimJow s aiid not in a 

s t iuiii alone vrwi ronni ent . 
' WA Iiosf.s must \w nirrre powerful ihim hosts for an 

iHjuivaient non-PPA printer 
' Tlif» printer driver rtHinire*; detaiUMl knowledge fif the 

] J ri n I i iig n u^'lianism . 
' PPA rfHiuired a ehange in the development and maiiufar- 

vuriiig paradigm ai the HP Wmr ouver Di\isiort 

Tlie PPA architedure di:M?s not supisorl printmg in the tradi- 
U<mal standalone OOS environment. In the Windows envi- 
rontnent, aU mforniatinn j?ent to rhe printer Ls bitmapped 
graphics. Tlie data Ls prep^u-ed under the control of a single, 
llPniiesigned and optimized printer driver. In the older DOS 
en\1ronmeni, application vendors write theii- own [printer 
drivers, A;>plicatiotis send AStll codes to tJie printer and 
ext^ect Oie printer to use Us owu internal fonts to generate 
the bimia|>i)ed chiu:iicters. The ai)t>h( ations h^ive no know]- 
eiige ^jf the printing met^hanism ;md hence are unable to do 
any .swatli cutting or data fonnaiting. 

The HP DeskJet 820C does suppoil prim ing from a DOS 
application if the application is nm under the Windows envi- 
rnmnent. Windows allows DOS-ojdy a]H>lit"ations to be mm 
ill a DOS ho.r. Printing ni this envitxjnim^n uses the standm^d 
Windows printing ntechanism mtd hence the IIP dri% er 

PPA i)rinii'rs requin^ a higher-powered tiosf Mum nun-PPA 
prime ts to arlileve comparable levels orpertormance. 
liecause the job of swatli cutting and data formatting is now^ 
(lone by die PC, more comjiuting t>^i^'^'' ^s required. Oil the 
HP DeskJet 8-2D(.V. acceptable levels of perfonnance are 
achieved with a (>((-MHz Imel4Bti-baseil tnachine with SM 
h>1esofRAM. 

PPA retjuired a si lift in die [IP Vaucoiiver Divisions develop- 
mc*ut and matui fact tiring panitligm. Havmg desigiM^d ajid bidlt 
f*(1,4)ased j)nolrrs for over 15 year's, all of the divisirjii's tools 
an! I iMOccssrs w**re ct^nterecl around dtis (ype of tJiiuler. For 
inslanee» over Ihe yeju-s the manufacl nring aiiil ciLshniur 
assunmce tirganij'rfidons had developed mm^y tools based 
£irotind PCI- ijrinters for doing t^roduciinn leasts and exercis- 
jtig the priuter in etivirmunental It^sts, N'one of these tools 
vviH^k wiib a PPA primer. Similarly, the firmware test organi- 
xatiott !ia*l to revise its tests completely. Because the HP 
Llesk-let H20V iirinter has only f>lK bytes of ROM, extensive 
demo Images mul st^lftest t>agt^s could no longer he inchided 
hi the printer. 

Because ot the higli level of integratifm and because the 
arcluteclurc follows the paradigm that "the j>rocessor 
doesn\ touch the dots/' it is tlifficult to cjhser^T the How r>f 
data througli the machine. This made del>ugging pi r jblems 
dtmng development quiie rliallenguig. This prohlent was 
solved in seveinl steji.s. Urst, I he ASIC design team tlid 
eKteusive simulatiotis. SecfHul, tlie It^am used a hardware 
emulator to emtilate the digital ASIC, This ennilator had a 
nuH*hanism that provided ports lo internal nodes so that 
ihey could lie olisened with a h»gk- analyser. FinaJly. siiutjh' 
pattcnis were di^vlsed and sent through Ihe archiU^c line thnil 
sim[i]itled prot^lems luul made debugging possible* 



Finally, in the PPA environment, the clover must liave kjiowl- 
edge of the printing hardware, TItis mirkes die driver less 
universal an<i the job of leveraging the driv^er to future prcKi- 
ucts more difficult. The driver was carefully organized and 
modularii^ed sf» the hardwari* dependent pieces can be 
clranged while the underlying driver featun^s can be lever- 
aged into funire produeL«i, 

Inside the Printer 

Inkjet print mg is a complicateil jiroeess that Involves tying 
together several elect rome<*himical sul>s>^stems tlmi work 
together to create the printed page. All mHJel printeis con- 
sist of ihese m^or suhsystenis regardless of Ihe particular 
imiJletnentation usetl for each one. Fig, 4 shows the HP 
Desk-Jet HZiK^ printer wjtli its top cover removed and the 
I n ^for s u tisy St ems 1 abeled. 

Paper Path, llie paper patli is rt^sponsible for moving paper 
tlnciugli the i>rinter. Ttie tiser inserts paper or envelopes ittlo 
the input tray. At the appropriate tittK\ a single piece of 
l)aj)er is picked from die srack and begins nioiing through 
Uie printer, BatOi thne the caniage finislies a pass over the 
paper, the paper Ls advanced an aj>prt>i)riate sunounl lo [)re- 
pare for the next pa^ of ttie carriage. At tlte end of a page, 
the paper is "kicked, "* or deposit etl in the output tray, where 
the ttser am remov^e it, tu Ihe HP Desk,let 820t\ a single 
electric motor is used to move Ihe [ taper. Patier mtivement 
is open-looiJ — tliere is no feedt>ack about the actual paper 
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position. The paper path iii tlie HP DeskJet 820C gracefully 
haiidles a variety of paper sizps aiul thiclaiesses as well as 
envelopes. 

Carriage. The caiTiage holds^ the pens used hi Lhe prii^ter. To 
priat aswalh of data, the priiilri- moves lhe carriage across 
1 he page at a constajil speed, firing lhe pens at appropriate 
times. A single motor is used to move the carriage. Carriage 
movement is a close<Mooi) process. The carriage s position 
is I racked nsuig an LEID, wliirti shines on a ]jhotorcceptor 
aJid a strip of pia^stic ina<!e np nlaHemaling hght tmd dai'k 
regions ijlaced lit^tween die LEI) and (he photoreceptor. As 
the cai'riage moves across the page, logic recognizes when 
the [jED is in front of a dark region ;nid when It is in front 
of a iiansparent region. Using this infonnatk^n, il Iracks tlie 
carriage s position on the page. In adrlirioo to holtiijig the 
|)ens, t he caniai^e holds a pruned circuit board. On the 
hoard are paits that ronner t electrically to (he pens and a 
portion of the electronics needed (o drivi*^ (he pt^ns. In fhe 
HP DeskJet 820C, all (electronics directly used to fn-e the 
pens are located on the caiTiage board (see article, page 38). 

Print Cartridges. The piint cartridges in the HP DeskJet 820C 
are iiser-replaceabit' cartridges that con(ain ho(h (lie ink and 
the mechanism fi^r placing the ink on the paper (thermal 
inKJet). They ar^e often refeiied to simply as the "i)<"f>^ " The 
pens are the same as tftose nsed in the HP DeskJet 850 antl 
870 prhiters. There are two pens: a blac^k pen and a color 
pen. The black pen has ,^00 noz/.les spaced at L/GOO inch. 
TIte swath height for black is therefore 1/2 inch. The color 
i>en holds three coloi-s of ijik: tyan. niaj^entit, and yellow. 
Each color is printed w ith a series of 64 nozzles spaced at 
1/300 iJK'h. Tlie swath height is therefore approximately 
1/5 inch. Colr^rs other than cyan, magenta^ and yellow tu'e 
created by i)iacing dots of these three colors in close prox- 
intity in appiojniate ratios. Suicc at a distance of more (han 
a few inches the resolution of the eye is not great eriough to 
discern the oidiviciual dots, they blend together visnally, 
forming the desired colors. 

Pen Service Station. To maximize the lite of the pens and to 
maintain oj^timum piint quality over that life, it is necessary 
to service the pens. Setviciiig includes hut is not hnvited to 
srich actions tis capping tlie ijens when not printhig so thai 
they do not dry out and wiping them on occasion to prevent 
ink biiikltrp. Tlic sendee station rnchrtles all the electrical 
and nieclrartical paits necessary to perform the servicing 
actions. In particular, it includes a motor that is used to 
act r rate actions such as wiphig and capping- Tlie motor is 
controlled by an o]>en-ioop process. 

Power Supply. A power supply is needed to prtmde energy 
to tlic printer The power supijiy accepts im ac- signal from 
a standard outlet and converts it to the dc voltages ai\d cm- 
rents iLsed to |>o\\ er the j>rinter. Bee aitse lhe HP DeskJet 
S20C will be sokl worid^dde, it is capable of riinnhig on all 
permutations of 50/{>0Hz and tl(J/220VmpLjt3 found around 
tJie world. 

Digital Electronics. The digital electronics are responsible 
for controlliiig al[ of the other e lee trtn nee han ical i)arts. The 
digital electronics genendly hiclude at least <me of each of 
the following: a microprocessor, a RDM, a DRAM or SRAM 
or botli, a block of custom logic, and an flEPROM. The 
microprocessor controls all mechanism movemenis, I/O, 



the user interface, and print data manipulation if necessary. 
Tlie RDM htjlds fumware, ajid in previous pn>dncts hut not 
in tlie HP DeskJet 820C, fonts. The volatile memoiy is nsed 
to hold firmw are variables and fjrint data and commai:ids 
that arrive ovei- the I/O p(n1,. The custom logic implements 
printer-specific functions that reiiuire hardware su|>port. 
The EEPROM holds inforn'tatirjn that must be retained 
thrxjugh a power cycle. In the HP DeMlclel S20C\ tlie micro- 
processor, the ROM, the fujstom logic, and an SRAM ai-e all 
integrated into a single ASIC: (see articles, pages 22 and 31). 

Case. The case is the pari of the printer that the customer 
sees, so every effort is made to make it attractive. The case 
includes a smdl panel of LEDs imd buttons l>y means of 
which die user imeracts with the (Jiinter. The front [lanel t»f 
tlie HP DeskJet 820C' is very simple, consisring of just two 
hutlons and three LEt)s. The case also has a door that can 
he lifted to gain access to the pens. 

Driver In addition to the physical i>aj t of the printer, all 
printer products require a software diiven which resides 
on the host computer Tlie driver allows applications soft- 
W'are running on the PC to interact with die printer. In most 
mofiem operating systerus. an application that wishes to 
print calls tlie priiUer driver duough the operating system. 
This model allows the printer mamifaciurer in supply the 
driver, soapphcation suppliers don't have to- The except it.>n 
to this model is DOS. which i"equires tlie driver be integrateti 
into the application. Because of the simplifrrations that can 
i>e made to the piinten the IIP DeskJet 820C only works with 
Windows applications, or DOS applications lunning in a 
DOS hox (see above and the article on i^age 12), 

HP DeskJet 820C Printing Sequence 

T<i begin tlie printing stHjuenct^, the user chooses Print from 
the appiopriate menu m the application. The applicabon 
lor mat s the ]>age into the standard description format nseff 
by the Windows operating system. IJsmg this foimat, the 
aptilicat ion |>asses a description of the f>age to the ] a inter 
fhiver. The driver reformats lite page mto a fonn aiDpropriate 
for sending to the printer. In the process of refonnatting the 
image, the driver peri'orms various tr'ausfonnations to map 
the imagr^ to the Inkjet |)rinting technoltJgy. In previous IIP 
iuKJet printers, the formal used to send data to the printer 
was P(!tj, a page description language, hi tlie HP DeskJet 
82t)C', tlu^ fonuat is a bitmapped image that can be used to 
fire the printheads with nunmial further transfonnations. 

Once the image is in the riglit format, data is sent to die 
pihiter over the P(.) caljle. Before the data can be printed, the 
ihiver mirst send commands to the iirinter t ixat I ell it to pre- 
pare to ].irint a page. Wlien the driver sends these com- 
mands, the printer first uncaps the pens and ser%ices them 
to prepare them for printmg. Then it picks a tiiece of iiaijer 
and advajices it to the fu'st spot where printing will tjcciir. 

After the printer is prepaied and has enough data in its local 
rnenioiy to print an entire swath, it per f onus a piinl sweep by 
moving the carriage across tlie jjage. As it moves the c:aniage, 
it pulls data out of its local memory, performs some fuial 
Ibrmatting, and uses the data to fire the print ireads at api>ro- 
priate times. Afa^r the sweep htis l.ieen completed, die printer 
advances the paper, warts for enoug!i data to print the next 
sw-ath to anive o\ er the I/O, and then, uijon cf uniuand from 
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f he rfri\ en prints the data. The process repeats for the r^t Summary 

of th^ page. At end of the page, again upon command from The aftvanre of personal computer horsepower aiid the um- 

ihe driver, the printer kicks the pa|)er. defKJsiting it in tJie fonfiiry of the Windows printing en\1ronmeni in which KP 

rnitpul tray. .Assuming that thert* iire no further pages to be has conlr<3i of lire printer driver ha\e marie it possible to 

printed. t)ie printer liten parks the carriage over the service change from the PCX printer model to a PPA printer model. 

station, caps tlie pens, and perfomts other deantip pen ser- Tlie customer benefit is that PPA printers can provide equi%'- 

\icing. The printer then waits |>aUently until the next time it alent levels of performance at a nuich lower cost. 
is called upon to print. 
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PPA Printer Software Driver Design 

The software driver for the HP DeskJet 82QC printer performs many 
functions that were formerly performed in the printer, including swath 
cutting, data formatting, and communications, The driver also includes 
a PCL emulation module for DOS applicatron support. 

by David M. Hail, Lee W. Jackson, Katrina Hc^iles, Kan^ii E. Van der Veer, 
and Thomas J. Jlalpenny 



The softw^aip flri\er fov \hv new IIP DeskJet H2iK' printer 
ineludps many r\v\\ rLmetiuiis tliat nc^ed to be peiforinefl on 
t h e h f >s1 r nni p u t e r bee a use o f tl \v pii nl e r's P ri n t i n g Pe rU ) j - 
nianre Aitiiiteclure (PPA). In older Pt L (Printer ConLrul 
Liyigiiage) i>riiiters, tiiesc functions were perfomied in tiie 
prjrtti^r. Fig. I si lows the differences. These finu (inits 
inehide: 

* Swarh currjiig 

* Data Ibrmatling 

» PPA c oni ni u n i eal i (.) i i.s 

► PtX emulation for DOS application support. 

This artirle pru\ide.s an ovemew of the ehangew net essaiy 
for siij>tH)niiig PPA and then discusses eaeh of I he riaKliuiis 
listed above in jn<in^ delail. 

Driver Overview 

Under the Windows " operating system, piinler drivers are 
responsible tor supporting a stJeeifie API t<«pi>licatiuji pro- 
gran un in J* interface) known as tlie DDI (Device Driver Inter- 
face ). This in ted ace gives the driver fairly higlvlevel drawing 
cojiniKiiuls. Ii is up In ihv driver to take those coniniMJi<ls 
tuid jnoduee a bitnici[> Huii f an be encapsulated ma language 
mid sent to the prini er. 

Typically, within a Windows printer drivei; a rendt^ring engine 
takes the Df3I conmmntLs and prockices a rendered bitmap, 
A halftoning algorithm is perfonnetl on the rendered bitmap 
and a halftoned bitmajj is |>ioduced. This halttoned liitmap 
is t>TJic.aUy m a fomial tliat eiin l>e encai>suhited in a language 
such as PCL and then given ttj (lie prinlcr. 

For the HF DeskJet 820C. Litis halftoned bitmap has to be 
put through additionid processing as shown in Fig. 1 to 
create data that is ready to be i^rinted by the printer's elec- 
tronics tlirectly. Tliis additional processing inthides swath 
cutting and sweep fonnatt ing, 

Siiice the HP DeskJet S20C does not undersfand PC'f. (Printer 
Control Language), a PCL emulation module is neeessaiy T<.) 
provide support, for DOS applications. The DOS applicatttjn 
data stream is captured by a DOS rediretlor and passed to 
the PCL emulator, wliich produces a halftoned bitinaii re^idy 
for swath cutting, 

PCL versus PPA 

Pig, 2 sho\\s the prinltJig motlcl for PCL printei^. For PCL 
printers, tiie process oreneapsulatiiig tiic htiM'toned bitmap 



is fairiy striiigbtfoi-ward. Raster (!ata frtjm the halftoned bit- 
map is cojuprcssed, PCL wrap tied, rmd then seni t(j tlie I/t) 
inothdc. The retison that this is a simple process is that PCL 
printers aio designed to receive data in the same format as 
I he halftoned bitmap. PCL printers tmwraji the data into an 
infernal laiffer' ^uid perform the necessary swalh entiing and 
dala hirmatting inlemidly. 

Fig. 3 shows the [jrinling mo<lel loi PPA pi inters. For (he IIP 
DeslcJet 820(.\ the PC.Lenca|)Sulator is ret>linefl with an SCP 
data encapsulator. SCP (Sleek Conunand Protocol J is an 
IIP~liroprietary command language. This module contains 
swath cutting functionality, data fcnnuitting^ Sf P language 
cncapsLilatioji, and printer status managetnejit. 

Raster data from the halftoned bitmap conies into the SCP 
data encapsulator goes (hrough the SCP manager, and 
eventually aiTjves at a ruster bltjck within the swath nuuiager. 
Tlie swath cuning state machine exanunes the data and de- 
tenuines the at^projiriate sweep to generate. A sweejj is a 
collection of rasters appropriate for the printer Jiiechaitism 
h) j>rint while it sweeps the printhearl over the paper. 

f )nce the sweetJ is general ed, it is given to the sweep for- 
matter The sweep formatter is responsible lor taking the 
sweep data and putting it into the appro)iriate format for tl^e 
HP DeskJet 82()C internal hardw^are. Theii the data is com- 
pressed, wrapped in SC-P, and hartdeil off to tht^ I/O layer 

The I/O layer is responsible for con mum t eating with the 
printer by wrapping the data streaju in MJnk and lEEI^ 12B4 
protocols, VXiink is an HP-proprietaty Ihik-lcvel protocol and 
IhlEE 1284 is an industry^-standard physical-layer protocol 

Performing Swath Cutting on tlie Hoat 

Swath cntting is the i>rocess of taking a page of halftoiieiit 
raster data and prfjductng sweet) riata ap[>n>priate for the 
caiTiage eleetnaiics to [ninX its the print head is sweeping 
across the i>age. Swath cutting has historically been part of 
printer finiiware, but in the HP DeskJet S20C pritaer, it is 
part of the software driver numing on t lie host computer, 
lypically. a swath manager encapsulates a swath cutting 
engine itnd receivers as inpiU a }>itniap representation of the 
I>age to be ijrinted, Tlie swath manager is resiionsible tor 
detenniniug how the pens and paper should be nio^ ed and 
when arrd how the pens slu>iild be fired to produce the 
printed page. The swath manager Jiitisi balance the often 
conflicting goals of printing widi the highest possible piiiit 
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Fig. 1. Printer driver fmictioiml block ciiagram. showiiig dltTerences 
between PCL and PPA clata parhs. 

quality and printing as fast iiS possible. The swath inaiiager 
mu^st be aware of certain printef-specific attributes such as 
printhead alignment and strategies to minimize line feed 
error In PPA, swath rnai^a^emeiit is performed on tiie host 
computer 

The process of swath cutting can be readily modeled tising 
a state maclune. Consider the example shown in Fig, 4. A 
state macliine rap able of processing this page won Id need to 
contain five stales: Top of Page, Blank Skipping, Black Text Printing* 
Color Graphic Printing, and End of Page, Thus, we can ffeate the 
state machine shown in Fig. 5. A particular instance of a 
state machine exists for each print mode tJie f>wath manager 
supports. For example f thcrr cnn Id be a print mode ffvr 



pages that only hm^e black text on them, another print mode 
for pages wit!i blac^k and colon and yet another print mode 
for pages with complex graphic images. 

As the state machine begins to examine the data on the 

page, n starts in the Top of Page state. The first data ir comes 
to is a series of blanks. This would cause it to move to the 
Blank Skipping state. During this transition the swath man^jer 
would t>pically load the page. W^hile in the Blank Skipping 
state, the swath manager would advance the paper. Next, it 
would encoimter a black text region and mov'e to the Black 
Text Pf inting Slate. Depending upon the t^pe of printing being 
done at that time, this transition may produce a sweep. 

Assume that for this print mode, the data on the page is 

being printed by making two sw^eeps for each line. Thus, in 
making the tran.sition from Blank Skipping to Btack Text Printing 
the printer could print the first pass of the black text region 
with the bottom half of the printhead, advance the paper 
half a printhead height, and then enter the Black Text Printing 
state, louring the next sweep generated, the Black Text Printing 
st^te would finish the lines thai were printed during the 
transition and continue printing the black text region 
(see Fig, f5). The data on the page wf:juld continue to be 
consumed and transitions made between states until the 
End of Page state is reached. 
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Fig. 3. PPA printuTg mode). 

Obviously, this example is a simple one. The number of 
states aiid the number of t ransitions to consume data for a 
real page caii be quire laige. Using PPA we have the oppor- 
timit>^ to perfonii tJie resource-intensive task of s^atli cut- 
ting on tiie host. This allows greater flexibility in developing 
machines v^ith unique print modes, which provides the 
oppoitiuiitj' for higlier print qualit^^' and throughput as well 
as reduced mechardsni cosL^. 
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Fig* 4. Swatii cutting s£ate itiacliirie Lransitions for a typical page. 

PPA Data Formatting 

The HP DeskJet 82(J's Printer Performance Architecture 
requires the host to perform the mE^jority of the data manip- 
ulation. The data that is sent to the printer must be in a for- 
mat that is very close to the final form used to tire the print- 
heads. The main difficulty in formatting the data for the 
printheati lies in the fact that the data doesn't come out of 
one position on tlie caiTiage mechajusm. Instead, there are 
two columns for each of the four pen colors. Each column is 
at a different vertical and horizontal offset from a relative 
zero carriage position. To minimize the cost and complexity 
of the electronics in the printer mechanism, the data sent 
from the host to the printer must be ordered so that it is 
ready to go directly into these offset printheads in the 
appropriate order so thai it is fired at the correct locations 
on the page. This ordering is based on: 

• The starting page position of each color 

• Tlie senxml arclutecture in the printer hardware (described 
later) 

• The piintliead (see Fig. 7). 




End ui Page 



Fig» 5. Strath cutting state machine. 
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Fig* 6. (a) In making the tiansition from BJanIt Skipping to Black Teiti 
Pnniing. the printer piints the Srst pass of the black lext region ^viih 
ih^ iwtiorn half of the printhead, advances the paper half a print- 
head height, and tiien enters the Black Tajrt Printing state, (b) During 
the next sweep generated ^ the Black Text Priniing state finishes the 
lilies tliat were printed during Llie transition and continues printing 
the hlack text region. 

To print a page, it Is necessary for the carriage mechanism 
to move back and forth across the page, firing drops of ink 
as it moves. Each movement of the carriage across the page 
Ls called a print sweep. Allien the driver receives a page 
to print from some application, it renders the page into a 
hallitoned bitmap. At this point, a P(l^ printer driver would 
send compressed and cricapHtilated PCL data directly to the 
printer. The PPA printer driver uses the swatli cutting state 
maciiine to generate a swath of data that can be printed by 
a shigie pass of the pen canlage. The resulting swath of data 
i^ passed on to the sweep fonnatter, which manipulates the 
data Into a buffer that can be copied directly to the print - 
heads. Tlte print sweep formatter uses knowledge of the pen 
carriage, hardware, and fimiware architecture to prepin-e 
and refonnat the data into a print sweep. 

The number of print sweeps required on a given page is 
fiepenclent upon: 

• Tlie atnount of data on the page (text or dense graphics) 

• The print ntode selected by the user fbest, normal, or 
cconofast) 

• The paper type (ptain, glossy, transparency, or special), 

l\iT ea<ii j)rint sweep, the host sends two pieces of informa- 
tion to 1 lie printer The first is the PR!MT_SWEEP data, a buffer 
of iniiige data sent l>efore the PRINT_SWEEP conunand, whicii 
contains an entire sweep of swing buffer data blocks in 
the correct order. The second piece of info im at ion is the 
PRINT^ SWEEP command, the mechajtisrn t>y wixich the driver 
tells the printer where and how to plate I lie piinl sweep 
tlata oit the page. A PR !NT_ SWEEP command contiiins niini- 
mtim iiiKl maximum positions for each pen column, the 

Paper in PrintSf 



print direction, print ^eeds, and N£XT_PfilftfT_SWE£P informa- 
tion. 

The PRINT_SW£EP command information is calculated by the 
printer driver based upon: 

• Which pens are acdve (Ijlack, cyan, magenta, yellow) 

• The starting and ending locations on the page for each pen 
color 

• The direction of the print sweep 

• The serv"ant architecture: 

The distances between jK^ns 
^ The distances betwe<?n odd and e\'en colimms within a 

pen 
c The 0,0 position in relation to the pen columns. 

Servant Architecture 

The sen"ant hardware fsee article, page 31) Ls composed of 
a pair of buffers, called suing buXfers, for each cohmin of 
the prinihead (two columns per color). To btiOd a print 
sweep, tlte driver must: 

• Separate the image into CMY planes, or primitive data 
blocks 

• Separate the primitive data blocks into swing buffer data 
blocks 

• Order the suing buffer data blocks into a servant image. 

A printitive data block (a bitmap image of each plane for 
each color) is created by tlte driver Each primitive data 
block needs to be split into two separate swmg buffer data 
blocks: an tsdd block and an even block. Tins is necessary 
becatise of the pen design, which consists of two oflset 
coltmuis, as pictured in Fig. 8. 

Each column on the color pen has Z2 nozzles. The color pen 
has a height of 64/300 inch. For any given colunin of data, 
rows 1» 3, 5, ...1 6^^ will he part of the odd coknnn and rows 2, 
4, 0, ..., 64 will be part of the even column. 

The even and odd swing buffer data blocks are each 8 bits 
wide, the width of servattt liAM, and each is the height of a 
printhead nozzle column. Swing buffer data blocks are cut 
for each primitive color and for either the even or odd 
nozzle coluntn. Thus, each swing buffer data block contains 
every other row from Ute primitive data block. 
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Fig. 7. MP DeffkJfit S2(}(' print cartridge layrjut . The lines cornsspuru! 
to iio/./Je columns and their general conllgurarioii on the \mnwv 
carriage. 
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Fig. 8. Each color pen has two offset colunms of rtozzfes. 
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Fig. 9. Pnmltlve dati block or^iariizatioji for a printhead that 
tiajs two colunms of six no'ZvAes per color. Byle n (n — 0^ 1. 2, 3, 
4, 5) is a btiffer of data 8 pixels wido by 6 rows (tiozzlesj high. 
The HP DeskJet 820C printii^ads have two 32-riozzie columns 

per coIf>r, a.s shov^^l in Fig. 8. 

Fig. 9 sfiows a siniplified example of a priit\itive data block. 
Each byte is a buffer of data that is one byie (B pixels) wide 
by K rows high, where N Ls the n timber of noz/Aes> in a print- 
head toluiun. For the example in Fig. 9, N is 6, while N is 32 
for the HP DeskJet 820C color priiitheads. 

Each column of the primitive data block in Fig- 9 is divided 
into four swing buffer data blocks with bytes relocated to 
the positions shown in Fig. 10. Only the cyan pen is shown ^ 
and only tw o of the swing buffer data blocks for each col- 
umn of Fig. 9 are shown. The drawing W'ould be similar- for 
the nmgenta ai\d yellow pens. 

Once the data is in the form of even and odd swing buffer 
data blocks, the blocks must be ordered and sent to the 
printer. This ordering is done with knowledge of the column 
spacing on the printhead iuvd knowledge of the order ui 
which tlic sGr\^ant architecture will retiiiire the data. The 
printer driver controls the order in which the eolumiis will 
trigger via fields m the PRINT_SWEEP comni^uid. The ordered 
swing buffer data blocks are then sent do\Mi as PR1NT_SWEEP 
data ready to be loaded into the primitive swing buffere in 
tlie pri Jit head. 
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C0:)( = Cyan Odd Printhead Golumn! Primitive Data Block # 
CE:k = Cyan Even Printhead Column: PrimitivB Data Block # 

Fig* 10* Suing buffer data blocks for the example priitutive data 
block shown in Fig. 9. 



Each primitive swing buffer consists? of two 8- bit columns, 
separated liy a swing friggcT point. While the senant print 
process is miloading one side of the odd column swing l^ufTer, 
tlie other side of tlie odd column swing buffer is being loaded 
by the sei'vant load process. Once the byte is loaded, the 
servant print process fires one bit by 32 rows at a time for 
each pen column in the color pen. When the ser\'ant print 
process has imloaded ail eight bits, it crosses a swing trigger 
point, and the servant print process switches to the other 
swing buffer and triggers the sen^ant load process to load the 
empty swing buffer. Tlie pen iires one bit by 32 rows at a rtme 
for each pen coiimui. The sen^imt (printer) is responsible for 
any complexity hwolved below- the byte level. 

Wlien all of the sw^ing buffer data blocks have been con- 
sumed by die printhead. the carriage mechanism uses the 
NEXT_PRINT_SWEEP information to position itself for the start 
of the next print sweep. 

Becatise tlie PPA printer relies upon the driver to format the 
data appropriately, the ^u'chitecture does not require the 
printer finnw^are to havc^ any knowledge of the operations 
just described. Thus, the cost and complexity of the elec- 
tronics hi the printer mechanism are significantly reduced. 

PPA Communication 

One of the goals of the IIP DeskJet 820C printer is to pro- 
vide continuous feedback to the user during any printing 
operation, mid to guide the user during problem sohing. To 
accon^t>l's^i ^J^LSj the driver requires a mechanism to ask the 
printer for information atui to allow the lainier to iiutily' the 
driver wiienever something happens (lite pri tiler ih> out of 
paper, the user opened the cover, etc.). The mechariism used 
by the PPA driver to commmiicate with the printer is called 
status messaging. 

To notify the user to align the print cariridges when a print 
cartridge has been changed, that the top cover is open, or 
that something else needs attention, a bidirectional Imk with 
the printer Ls required. Tw^o ncw^ HP-proprietary protocols 
allow^ the driver to communicate bidirectionally with the HP 
DeskJet S2()C: VLink packet protocol and Sleek Command 
Protocol (SCP). Previous HP DeskJet printers used an I/O 
packetizing protocol called MLC (Multiple Logical Ch^uinel) 
and a proprietary HP printer conunand protocof For PPA, 
\T^mk replaces MLC\ and SCP replaces both PCL and the old 
printer command protocoL 

While giving users error messages might seem to be a luxtuy 
they could do without, the real reason to have a protocol 
like \T^ink is that it is useful to figure out what is wrong 
wiien, for example, die printer's input buffer fills up, the 
printer stops accepting data, and the host is unable to send 
even one more b^te. Tills often happens and is temporiuy. 
but in the days before bidu'cctional protocols, the d2i\Tr 
w oiild sometimes wiiit ^md wait to be allow ed to send again, 
and it didn't know whether the delay was because the top 
cover had been opened, a print cailridge liad fttiled. or a 
fatal eiTor had occurred. It is helpful to know whether to 
abon the job or ask the user to insert a print cartiidge or 
close the door. Witli a bidirectional protocol, the printer tells 
the driv^er exactly w^hat the problem is, and tlie driver can 
decide what action to take next. 



1 6 June 1 f ?P7 He wleti- Packard iloumal 



)Copr. 1949-1998 Hewlett-Packard Co. 



Ff oth Orfver 
FrQiii End 



PPA DrrvcT &Bti Path 
Data EncapsvUtor 



SCP M;inai^e} 



HP Ocskjel 



I/O Mansgef 




Fig, 11. 1'l^A Status n^ssagmg archiiecture. 



Daia that is sent bj- the printer such as notifications that 
something is wTong, are put in the printer s outpyi buffer. 
The cJriver spawns a hidden executable at the beginning of 
each print job called the jmii suiffeK wliich checks the port 

e\'er>" iiatf second to deiennit>e if the printer has sent any 
daia- If so, the data is routed t^irough the IEEE 12S4 layer 
to the \Xink layer, which then posts a message to the I/O 
manager s hidden status window. 

The status window uses a callback to caD into the SCP nmn- 

ager. whicli translates the status information, and if the mes- 
sage is something that should be dis[>layed to the user, puts 
it on the event list. The event list prioritizes the messages on 
it so that the mc^t important message gets sent to the HP 
Tboibox, which displays the dialog box to the user If the 
message is nn error, it may get resolved (for e\:ample, the 
user puts paper in the printer and pres^s t)ie Resume button)* 
The message is then routed up through the same path and 
deleted from the event list. Tlie Toolbox takes the dialog box 
down ai^d displays tlie next mosi imjjortant message, if 
there is one. 

Internal Object** in PPA Status Messaging 
PPA status niessa^g involves severiil higli-level modides 
and objects: tlie SCP (Sleek Command Protocol) managen 
The I/O manager, the VLink module, and the event list (see 

Fig. 121 

SCP Translator. The function of the SCP translator object in 
the SCP manager is to encode data mto the SCP format and 
flecfxie messages received in the SCP format from Ihe printer 
iixto quei^y' rephes and event iiUl>rmation. The SCP translator 



A bidirectional link is not required for printing or to have 
limited status feedback from the printer However, uollke 
P( 'L pii[ Iters, which can accept either PCL data wraptn^d in 
MEjC or raw PCL data, PPA printers can only interjjret data 
vvrat^iJetl in VLink and SCP. Thus, while MIX! is mi option 
I hat i'<m be added when a bidirectional link exists, \Titik 
n iiLsf handle printing with and without a bidirectional link as 
well as printijig to a file. 

Based on VlJnk's chartnelization features, there are two 
pat lis the data can take to the printer One is for image data 
(the dots that will g(j on the page), and the other is for com- 
mtUTd data. ( 'onunmul data includes commands sent to the 
|)riiiter, sucli as "Print this sweep," requests (or information, 
or queries, such as "What pilnt cailridges iire histaJled?'^, 
and status infomiation, termed aittostatuSf such as '^The top 
cover is open." Sending image data is easy from an I/O 
standi>oiiU — if the printer has room in its i)uffer. the driver 
will send the data. Since com man fl data nnisf be sent and 
also received (autostatus tnay c^ome in at any time), it is by 
nature a more complex affair. 

As shown in Pig. 11, data that comes in from the front €'nd 
of the {1ri%Tr goes thrntigh fhe data encapsulator, like PCX 
printer drivers, but from there it goes through several ru^w 
objects. The SCP m imager wraps (he data in SCP mid sends 
it lo the VO manager, which provides an interface to the 
datacomni ol>jects. The \Tink layer wraps the data In the 
VLijik protocol and sends it lo Ihe IEEE 1284 layer and out 
to the printer 
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Fig. 13. S('f* coniiTiarni forniat. 

does not send SCP datii (iireriiy \n i\w I/O majiager, hIik e 
memory managenieni for the data buflVi's is done in the SCP 
translator's rljenfs. wliifh are tlie swath manager and the 
status nianagpi\ The tiient of tlie SCP iranslaicjr passes in a 
pointer to the ciata. an t»m|)ty huifer, and ilie maxinumi ciata 
length, Unce the data has been paekagod, if the SCP transla- 
tor llnds that the data iB larger tliaii the buffer, il will return 
an eiTon Ot hei"v^isc\ it will pass t>a<: k the at tual SCT data 
length. The goal in designing thc^ St P triuislator was to en- 
capsulat e the Sleek Commmid Protoetjl s(j tiiat ehajiges in 
SCP in tJie finnwait^ aiTet t ciienLs oi' this module a.s little as 
possible. 

Commands in SCP use the format shown in Fig;. 13. llic 
ronniiund s[H'<*i[kT field identifies the SCP command. The 
length field inditates the number of b\4es in tlie data field. 
The data field does not exist for every command. 

Priodties. Prionties idlow the printer to execute commands 
in a dilferL^nt ordur than received. This may be necessary 
when a eominand cimnot comi>lete execaition mid it is desir- 
iih\v lor the prinler to ijrocess queries so the tirivcT ean tlnd 
oii( wliai tlie problem is. Priority levels are defined in the 
SCP translator' and the clients can set wtiatever priorities 
they like. Stmidard in iorily levels are defined as shown in 
Tal)le 1. 



Table I 




Command Priorities 


Command 


Priority 


Printing Commands 


Low 


Queries 


Medium 


Initiahssiiig and 1 ieinitializmg 


High 


the I/O Link 




Recovei irig from Eitoi's 


Reco\er 


Canceling 


Cancel 


Restaithig ttie Pruiter 


Restart 



It is assiunpd that the swath manager will send all of its 
printing conmuinds (LOAO_MEOIA. PRIMT_SWEEP. EJECT_MEDIA) 
at the lowest priority. Any gueries it needs to make wiH ciill 
into the status manager. All queries should be at the same 
jiriority aiul Iiigher thmi priJifiug commands. It Js up In llie 
citertts I o set priorities. 

Status Manager. The status manager manages messages to 
and fioju the printer. These messages can be broken into 
two categories: events and queries. EveiUs are imsolicitcd 
nofiiications by the printer (i.e.» autostatus) that something 
has occurred tu change the state of rhe fuinter. such as "the 
door is open." Queries iiie requests lor infonnation made by 
rhe driver to the printer, such as the pen IDs of tlie installed 
pens. The status manager traeks the state of the pnntei- ^uid 
creates events when state changes occur. For example, when 



the Resume lint I on is [^res.HCfL an intenial stale < iuitige occurs. 
Tliis .stale cliajige is reeugnized by the stal iis manager and 
reported a^ an event to tlie event translator 

Wlien the status manager receives Jiotification of an event, it 
determines what has changed and whether the event is some- 
thing die event translator hits oHiuesled to kiiow^ about. If il 
is, a callback in the event translator is culled. 

Upon stariing a prini Jolt, the status manager querk^s the 
printer to get the cur re n I state of events. No event iinlifK na- 
tion will be received until ^tn eveni ot^euis in the printer. 

Event Translator This module exists IxHween the event list, 
v^hieh is Windows-specific, and the status manager. The 
event translator translates the bit-field data, wliich is re- 
turned to the status manager by the printer in autostatus, 
into events. New events aie added to the event list by the 
status manager, ;ind events tliat me no kmger valid (e.g., tlie 
door was open fiut tiie iisei" shut it) are removed from the 
list. Tlie event list orders die events reported to il ac t'4>rdhig 
to their import mire to the user, and tells the status rTionilor 
winch dialog box to display. From nmst important ( 1 J to 
least important (lO), the following event priorities are used: 
(1) 1/0 errors. (2) paper Jam. caniage stall, or maximum 
theniial limit, (3) pen faihire. f 4) wrong iien, (5) low or out 
of ink, (d) pen missing, (7) nut nf puper. (8) ecwer o|>en, 
(^y) diy timer, (10) new jjen. 

I/O Manager. This module is intended to glue the VLink mod- 
ule, which is Windows-specific, to the St.T manager, which 
is shared. Hmidling for events, tt^i cries, and liuffer manage- 
ment must be pt^rfonned by the 1/0 manager in adthtion to 
sendmg data to the jjrinter ius quickly as possible. 

Events. The 1/0 mattager creates a hidden wdndow so that 
wlien tlie printer sends unsolicited event notification, Win- 
dows messages to that effect can be posted to tliis vrindow 
by the \Tink mt>dnle. When the I/O mmiager proct^sses this 
window niessagt\ it will read the SCP data! suffered l>y 
VLink and call a callback in the slatiis manager, passing in 
rhe SCP data. 

Queries. To get replies to queries, the inqidring module calls 
VLmk, specifying a buffer in which to place the reply. M^ink 
checks this queiy reply buffer to see if anything hiis been 
re tu n \ e<^i i n respons e t o t he qi le r> . I f so , i i i n 1 1 n ed i 9 1 e \y re- 
tums widi I he SCP data. If nol. it polls the inromhig chiumels 
ff jr a spec i tied timeout jjeriotl (o aUempT to retileve the reply. 
If a reply is received before the timeout period expires, Uie 
SCI* data is [>assed through to the status manager. 

Datacomm Paths. The image and command datacomm paths 
send data tu I he piinter lis long as ihere is space in the buffer. 
If space ntus out, the rommmKl dat.acoiuni pash waits until 
moiv space l>ecomes a^^ailable. The image data is hantlled 
dd'ferently. If space iims out while sending image data, the 
iniage datacomm path returits to the caller. alUjwing it to 
render more swat lis mitil more space becomes free in the 
printer. 

VLink. The \^Unk modi he must package data iti a protocol 
the printer recognizes, mid send only as much data as die 
printer can ta.ke, as quickly as possible, \link must also 
miwTap data from die printer mid route the messages to the 
appropriate chenls. 
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T\\e VUnk protocol replaces MLC (Multiple Logical Charmels) 
for the HP DeskJet 820C\ Lake MLC, \liiik s intern is to pro- 
vide a way for the host and the periphefal to exchange data. 
Unlike MLC, Vlink is not optional- All data going to the 
printer must be wTapped in its protocoL In addition. Mink is 
streamlined or '^sleek,' and doesn't have many of MIX? s fea- 
tures. MLC supported multiple logical channels, while VLink 
supports two outgoing and three incoming channels. 

Outgoing Channels. The printer accepts data in either its input 
buffer or its conmiand buffer The \link module specifies 
which type of data it is sending through a field in the Vlink 
packet header. A template of a Vlink packet is shown in 
Fig. 14. 

Image data is sent to the printer's Lr^put l>uffer on tlie image 
data output channel. Comtuands and queries are sent to the 
coiiunand buffer on the command data output chaimeL 

liTCommg Channels. Since a bidirectionaJ link Ciinnot be guar- 
imtced, till incoming data is optional. Tliis is necessary fur 
file dumps and bad cables, and miscellaneous comniuiiica- 
tion problems. 

The printer periodtcaJly notilxes the host how much buffer 
space is left in the printer. This is known as ci^edit, and the 
printer sends not ifi cation for botli the command and input 
buffers on the credit input chamiel. The \Tiiik module will 
not send more data than the available credit, 

Vlink accepts two types of data packets from the printer in 
addition to credit packets: qKiery replies, which are expected 
on the status input channel, and a collection of bundled 
items regarding printer status (such as out of paper), called 
aulostatus messages. Autostatus messages ultimately map 
to events. 

An autostatus message from the printer consists of a bit 
collection of several long words representing tfie current 
state of the printer. For example, when tfie door is opened, 
the door open bit in the collection is set. to true. A ret>or1 is 
generated on the autostatus input cbEumel when any of 
these bits are toggled. 

Wlien the Vlink layer receives some data^ the data is identi- 
fied ivi eittier credit, a query reply, or an autostattis message. 
Credit is interpreted and liandled witliin tJie \Ttink module. 
A query rejily or an autostatus message is buffered IntemaUy 
so that tlie clients ran read it later. 

If a received message is an autostatus message, the VLmk 
layer posts a W'uidows message to the I/O n liuiager indicating 
thai an autostatus message is waiting to be read, UTien the 
L/Q manager processt^s the Windows message, it reads the 
buffenvi autrjstatus message. Posting a message is necess^uy 
so that VLir\k can be free to poll die data lines for more 
incoming data from the printer 

Once the buffered nit\ssage has been read, it is <Jeleted. Only 
one query reply and one autostatus message can be buffered 
at a time. If a new message comes in before the original 




message can be read, the new mc^ssage replaces the old one. 
It is for this reason that no additional printer queries shoidd 
be made while waiting for a reply. No harm is done if a new 
autostatus message overwrites the old message because 
the same information is contained in each message and the 
newest message is the most relex^anL 

FCL Emulation for DOS ApplJeatlon Support 

The development perioti of the IIP DeskJet B2()C coincided 
with most users rafxidly transitioning aw^y from DOS appli- 
cations towards windows applications. UTiile we expected 
that most users w^ould tise the printer in its optimized design 
center, we recognized that we needed an adequate bridge to 
the few DOS applications that would continue to be used. 

The IIP DeskJet 550C printer w^as the final printer to be sup- 
ported by mosi DOS applications, so the solution had to be 
functionally compatible with this printer and provide equally 
good print quality We chose to prrmde compatibilitTr' v\rith 
the HP DeskJet 66UC printer, which was a contemporaiy' 
printer that saUstle*l these requirements and provided an 
internal interface that enabled us to separate the PCL per- 
sonatity from the printer engine fimiw^are. We planned to 
pori tlve PCL personality functions to the IIP DeskJet 82 DC 
printer dilver, encapsulating them in a PCL einulalor module. 
The required printer-engine fimctions would then be supplied 
by the rest of tlie HP DeskJet 820C driver in this way, we 
could minimize design changes and nuLximize the chances of 
identical eompatibilit^^ If a DtJS application is run from an 
MS-DOS prompt vtindow. also referretj to as a DOS h<M\ the 
printer driver can iittercept ttie PCL data stream that the 
DOS application sends to the PC's parallel port and redirect 
the data stream to the PCL emulator 

The HP DeskJet B20C PCL emulator encapsulates the HP 
Desk-let fH^OC " Jonriatter ami text engine code. The design of 
tile HP Desklet (M)C iinnware was such thai all interfacing 
to t he external mechanism was done through a well-defined 
AI^I inten^ally known as the Ed I nlejf ace (see Big. 15). 

The Ed Interface resides between the formatter and font 
manager and tiie rest of tlic Onnware. It is a collection of 
iunc^tion calls to die support code in the firmware. Since we 
reused the formatter and font nuinager cf>de, we provided 
the equivalent firmware functionality by mapping t:he Ed 
Interface caUs into HP DeskJet 82QC' support objects. 

The fimctions of the formatter and text engine firmware 
code w^ere written in C, mui as such m-c fmictions in the PCL 
emulator appbcation (Fig. 16). The PCL emulator appiica- 
tion provides C++ objects that encapsulate the functionality 
expected by the Ed Interface. 

The PCL emulator application is designed to receive a file 
name f.hat contains the P(]L data to ojierate on. Interfacing 
between the internal PCL emulator object and the external 
driver is provided through a PCL personality object. 

The PCL emulator is implentented as an executable applica- 
tion becatise the original tlrmware code expects to be a sep- 
arate task, and tliis implementation allows almost direct 
porting of the HP DeskJet 660C firmware code. The FCL 
personality pro\ides the haiuller fimctions and the extemtil 
interface for receiving tiie PCL file name. 



Fig. 14, V\Ank packet format. 
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Ed Cr&aic Poaf 
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Ed D^fitiill Qunry 

Ed Ddiiide Ufid^M^} 

Sfi m Mffm Updnto 

Ed Fdiiii F'trnd WHiliny 

Ed Fas Request 

Id En or Trap 
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Road Nifil Entity 



Fig. 15, PCL enuilation is pro^idetJ iii ihe HP DeslcJet SMC piinter 
by mapping the exist uig Ed Inietface calls to DeskJet S20C support 
objects. 



To aOow DOS apptications to print to lite HP DeskJet 820C, 
it is necessary to capture the tiata generated by the DOS 
applications. Tliis process is referred to as DOS box rediwc- 
NoiL Essentially, it Ls necessar>' to capture tJie bytes intended 
for the jDarallel port and put them into a file so tliat tlie PCL 
emulator can properly interpret the data. 

Under Windows SA, DOS box redirection is not part of the 
operating system, so it was iiecessai^ for us to provide a 
redirection stjlul ion. This functionality is provided by a 
redirector VxD ( virtiiaJ cie\ice driver), a redirector DLL 
(dynamic Unk Ubrary), and a redirector EXE (exectttable), 
as shown in Fig. 17. These three pieces captnre the data 
stream and put it into a temporary file. This file is then hand- 
ed to the driver, and the driver hands it to the PCL emulat^on 

Under Wmdows 95 (Fig. 18), DOS box redirection is provided 
by the Wmdows printing system, so our redirector solution 
is not neeessEuy for spoohng to work under Windows 95. 
PCL printers essentially get DOS box re tli reel ion free. PPA 
printei^s need to intercept and perform PCL emulation on 
the DOS data stream. Microsoft provides a replaceable mod- 
ule called a language monitor where the data stream caii be 
intercepted. The language monilor is a :J2-bit DLL called 
directly by the spooling subsystem. The language monitor 
takes the incoming Iniffers, writes tliem tcj a temporal^ file, 
and passes the file name to the driver. 

Porting the Firmware 

Tlte process of porting the C-laiigtiage ieiife feom the HP 
DeskJet 66(JC presented several challenges. The original 
tirmware was developed for a Motorola (BtJOd processor, 
wliile the printer driver runs on the hu.el 80x86 processor 
in Windows U>bit mode. 

These two hardware platforms have conflicting ways of 
addressing memory for data types larger th<m a byte — the 
fonner is big endian (the most significant byte comes first) 
and the latter Ls little endian. As long as a data element is 
consistently accessed with the same data t>pe, there is no 
problem. However, there are places in w^hich a data type is 
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Fig. 16. The PCL emulator appU- 
cation provides C++ objects that 
eiicapsLLlate the fiOK^tloiiality 
expected by the Ed Interfaoe, 
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vmiten as several singJe b\ies. fhen read as 2-b>te or 4-b\ie 
t|tiaiitities. \VV net*dect lo idetUify and ciiaiige tht* code in 
thest* places. 

Tlie original font data that d^^ribed the glj^ph (sliape) iiifor- 
nialion for the text engme was a single block of 250K b>tes 
of read-only data, ThLs block was inai.ii>ed lo five blocks of 
resound data, since each block liad to be le^ than tUK 
t>>le^i for \rmdows HRii! nitMJp. Ttiese blrw^ks arc discard- 
able* nieaiimg that tiie operaiiiig systeni can load ihc^ni when 
it needs in read some data. l)ul to load other co<ie or re- 
source blocks when Windows lias mn out of tnemorj', they 
can be replaced by other blocks. 

The origii^al fimtware s text engine depended on a special 
liiii'dware component that rotated font gl\pli data from hori- 
zontal to wrticai oriental ion. could double the size of the 
(liita. ;iiui smoothed I lie etlge.s of a glyph using several rules 
for HP Resolution Erihaocenteut technology (REr). Since 
! his hardware %vas not available to the printer driver, w^e 
were able to sinuilate the first luni second of these functions 
in softvvaie. We (letermined that the ])rint tjtialit\ wnuld hIIU 
he better liian the HP DeskJet TwOC even if we did not sinui- 
late the REt lilies. Ttie resiilthig software sintulation exectttJt^s 
more slowly, but the orginal firmware design included a font 
cache, wliich minimizes the the nimilier of liuies llmt we need 
to execute this function. 

Some further syntax modifications were necessary/The 
)irinter driver is capable of supjioiting more thati one of the 
same piliiterj for example, a printer on the LPTl port and 
atuither on tlie LPT2 ]:iort, and these printers can be priming 
al the same time. For Windows to l>e alile lo execMUe nmltijjle 
mstances of the PCL emulator, the ctxte nuist be compiled in 
the Windows r}H'diinu-memo)y tnadf'L This required Ihal 
nitmy t '-language pointer \'jiriaJjles be iU'ir^igivdUHifarpohifns 
rather than the muiv c^fllcient nearpoinleni. Also, some 
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TiK. 17. DOS box rrditf'c tion fm Windows :iL 



Fig. 18. 1 H JS liijx redirocUiin far Wlndenvs JJIS. 

hiublle syntax eon-ertion was necess^iry because an integer 
data type is 32 bits for the 680D0, but 16 bits for the 80xB6. 

Tlie PCL enuilaiif>n irnplenu^ntation \^as accompUwfied in a 
staged tleveloptnent [jrocess. Two months before the nrst 
piinter driver comj)onents to support the IIP DeskJet 820C 
beeatne a%'ailaljle, we were able to build a t>( )S aiiplieation 
that wa.s lotally de<*oupled from a printer tiiivt^r. It wotdil 
act ept a test hiijot stream of PCL data and niaij the input to 
;in (mt|iut file of riister data, which eouki he pruited on the 
HP DeskJiM 8r>0C, which was ntec^hanieally Identical to the 
target IIP Desk^Iet 82t)C. losing our test center's extensive 
suite of mput test files, we weie able to stabilize the porting 
implementation, within the limitJSof the [)()S apphcalion. 
P'or exanuile, we ntiticed tJuit the DOS uiennjiy allocatu>n 
algoriihin would fragnienl meinoiy that was being t rinliiuuiily 
allocated mid (rei'd, so I hat eveiUnally :i inenit>iy allocation 
recjiiest would I'aiL nowe%er, when we niov(^<I nu to a siil^se- 
queni, stage in which we depended t)n the Windows menioiy 
manager, w-^e foutifi tltat this menioiy fragmentation no lot^ger 
occiuTed , ( ) I ice the I >() S p< ) n w lls ^\ at» i 1 i zed , we i n i eg ratet 1 
the IVh personality into the prinh^' driver, using ihe HP 
DeskJet 850L^ output target patit, while still iJioviihog an 
input file of PCL Next we introduced and stabilized the DOS 
redirectxn^ input [rath. Wlieii Ihe HP fJesk^Iet H2i\C output 
target i>uth finully heraine avjulable. we were able to switvii 
to it cleanly, mid the PC L eunilafor became an efh^ctive tool 
to help st^ibilisce the new output target path. Finally, we com- 
pleted the target functionality, always buikhngupon a stable 
base. 

To sunnuurlKe, by n^nsiiig original fimiware code we were 
able u> proviilc identical Pt L hnicticjuality tor PPA printers. 
Providing support for the Ed interlace API allowed the linn- 
ware eode to be reused with little design mod ifi cat ion. 

Windows JS a U,S registered iratfemarit of Microsoft Corporal™, 
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PPA Printer Firmware Design 



Hewlett-Packard s new Printing Performance Architecture (PPA) includes a 
significantly reduced set of printer firmware. ''Don't touch the dots" was 
the firmware designer's golden rule. This means that the firmware and 
processor do only mechanism control, I/O, command parsing, status 

reporting, user interface, and general housekeeping functions, 

by Erik KHk 



A significant hivUw m Hewlett -Packard's new PrinLiiig Per- 
fomiaiK'e Architettiue (see aiticlp, page 6) is the leciuetion 
of the processing power em be tided in the piiiiler. Using the 
hosl PC for all image formatting leaves only motor^ print 
cartridge, I/O, user interface, command, ^^d status funetioiis 
to t)e controlled by the linnware. This results in signitieant 
cost savings by reducing processor needs and t)y reducing 
R(>M and RMl iec|inrements. Tlie goal, which wm achieved, 
was to reduce tlie ROM requirements to 64K bytes. 

Fig. 1 shows the Iraditional firmware architecture used in 
HP DeskJet printers. The firmware receives from tlie host 
PC a combination of text, text formatting commands, and 
tuster graphics data. Tliis is formatted according to the 
Ht*w let! -Packard PCJL printer language specifieation. The 
inRinualion to print lurives at a page descripfion level, 
wtiich requires firmware to rasterize a bit image, generate 
anfi pla<^e fouls* antl format and cut the image into swaths 
according to tlie retiuirements and farnuit of the print 
cartridge. 

At tlie VO tayer, previous IIP DeskJet printers make use of 
the Muhiple bogical Channel packetizing layer (MLC, being 
proposed as l^^EE stancUml 1284.4) to orter uuilliple cf.nuiee- 
tions between a host and a iirinter. I^CL au<i an MP propri- 
etary peripheral status laiiguage share the bidirectional 
parallel port, 

Tiie rasteri:aing step invtilves converting text and text for- 
matting c<>mmands uil(j a gratJhicat l>it image to be printed. 



Separate bit image planeis are created for each of the foiu* 
ink colors: Eilack, cyan, magent^i, and yellow. 

The swath cutting step involves cutting the bit linage into 
print-cariridge-higli swHtlus. performing image enhance- 
ments such as overlapping print sweeps* and at yu sting tJic 
bit-image phiues to the particiikir format retjiiired t>y tiie 
print cartridges used iti the printers, 

In general, not cmly does the traditional TIP DeskJet firm- 
ware consisi of more modules l>ut the modides themselves 
are considerably more complex than with the lu^w Printing 
Fc ri'om lanc e Aj chitect ure . 

PPA Firmware Architecture Overview 

The primaiy goal of ihe Printing Performance Aichitecture, 
or PPA, is to reduce the price of an HP UeskJer printer while 
maintaining or increasing print performance. The digital 
elettronics poition of this savings is accomijlished by reduc- 
ing ROM, RAM. ami miciopiocess^jr costs. ROM is reduced 
Ijy moving the rasterisation, font, and swath module fuuc- 
t ioi^s onto the host's printer driver, and by using streamlined 
1/0 and command language f)rotocols. RAM is reduced by 
requiring only «^ no ugh RAM for a worse-case print sweep 
|)lus sf^are Ra\.M for mi ware overhead. Microprocessor 
costs aie held do mi by reducing tlie processing, m particular 
the data processing, reqmi'ed of the microprocessor. The 
"don't touch the dots" ccjneept enabled the use of a low-c^ost 
6BO0O processor 




Fig. 1. TratlJtional HP Desklm 
printer fininviire ariihitt^clun*. 
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Fig. 2. HI* DeskJet ^iHZ printer finiivvjire architecture overview. 

Fig. 2 shows the fimiware architecture of the HP DeskJet 
S20C. [t coRsisis f>f asmiiJ] set of conimunicating riKMluIey. 
Ench mofhile is miplementiHl witli a few eoinintjnic aiiug 
processes and intemipt service routines. Tlie proce.ss<*s 
conimunic:ate through tJie use of n Messages (see below). 

The UO Tnothjle receives data atid rommancls from the host 
PC, passing riiern on to the coninianti module, and traustuius 
responses and status infonnation back to the [K\ The rotn- 
niand mcKiuJe parses and prioritizes the incouiing coruiuaads 
and passes them on to tlie other modules, most, often the 
mechanism module, for execution. The mechanism module 
receives paper load and eject. priiU sweep, and prim cm - 
tridge servicing conmiands, ]j(^rt'onns the requested actions 
by controlling the motors and print can ridges, and i>asses 
the results back to the c;ommand module. The U/l (user 
intetface) moduh^ !i an dies tlie froni -panel state tnacliine, 
sending comtnii 1 1 ds (o die eoruinaiid module as necessaiy 
Tlie st.aius module mnniiors live (>riniers stattis, coumuini' 
eates this status hack to llie VV via tlie 1/U motlule, mid 
keeps the rest of the modules informed of system status. 

Procei*j*es, Messages, aiid Operating System 

A smaJl and ellicient custom operating system manages the 
t^xecution of nmlliple processt^s and the delUei-y of mes- 
sages from one iirocess to ancHlier. The ope tali ug system 
alst) provides suptjorl fov infernifif sei"vire routitu^s. rh^laytnl 
j}rociHlur(^ I'aJls, and binaiy semttphoies. 

Pracesses. Multiple, cooperating iiitiependeut threatls of 
execution veiled pn}crsses are used to provide priority, tnod- 
ularity, and f*arallelism within the PPA finnware aiuiiitec- 
tare. Indivldnal | processes avv instantiated with a fiuicUon 
stack, a fixed prioiiiy of execution, and a specine set of 
liroadeasi classes. The highest prioiity reatly process exe- 
cutes until eitlier a higher-priority process becomes ready to 
execute, the current process is blocked waiting for a new 
nies'sage, or the |)rocess is l>locketl waiting h*r a semapfiore 
to be unlocked. The prcK-ess's luMadeast classes in<iicaie 
which set of broadcast mt^ssages tiie proc^ess wauls to re- 
c*eive, Pro(*esses are static anci never terminate, 

A fundamental architectural concept is that there is a one- 
to-one c(UTcst)oiiflervce btlvveeri a process and a tiicssage 
(lueue. In oilier W(U'ds, each mid every prtjcess lias ils own 
qtieue for messages mid no (>ther queue. This ctmct^pt is 
Itardwired into the systent There m*e uo facilities for the 
creatttm or use of any other message queues. When a pro- 
cess nHiiic'sts a message, its eoun xl defines which queue is 
selected, 



The PPA firmware desi^ is rather litieral with the use of 

procc^sses to both modularize ktrnl parallelize functionality. 
Table I shows the eigtUeeti i>rtK-ej5^es used in tlie IIP Desk- 
Jet S20r printer. 

Tdiile I 
Rrmware Processes in tlie HP DeskJet 620C Printer 

Major Firmware Module 



m 


Camntand 


Mechanism 


Status 


m 


Otiisr 


[Q 


Parser 


Mechanism 
State MachfTie 


Autostatus 


Ul 


PState 


IEEE 12B4 


Pacer 


Walker/ 
Dispatclier 


Status 
Request 




CDnfig- 
y ration 


VLink 
Pacing 


£xe cuter 








WV 
RAM 

Execute 
Data 

Test 

Print 

Simple 



Messages, Me,ssages form tlie fundamental communication 
methofi betwt^en processes, Physjeally, messages are fixe^l- 
size, small l>U>c*ks of memoiy They contain both required 
and optional fields, 

Tlie typical life of a message is as follows. A process ac- 
quires an uninitializerl ines,sage from the operating system, 
Tlie proct^ss filis liit^ necessmy message fields, llw message 
Ls posted tfj anotJu^r tuoeess with a spt^cific i>ri<irity Tlu^ 
receiving in^ocess gets the message and perfonns the action 
unplied by the message's identity. Depending on flags set 
within tlie message, a response mc^ssage may be posted 
back Ici the orighiattjror the message may be released back 
to the t>pc^rating system for reuse. 

The reception of messages can be gatefl by a priority or lim- 
ited by a timeout or both. Messages can bv ptjsted to an indi- 
vidual i)rocess or broadcasi \n many processes. Tlie posting 
of a message can be defened (V^r a st»ecific time to tiro\ide 
for periodic actions. Iiiternipl service routines cau only pttst 
messages, so arrangetnents tnust be made to acquire Ibeir 
messages outside of inteiTupt execution. 

Table II shows the message struct ui^\ Messages include a 
token fleid, wiitcli gives the message an identity or specific 
meatling. For example, a command tiiodule process r(M|uesls 
raw ill] >ut data by posting H) an 1/M module process ihe 
RE CV REQUEST message (a tnessage with its token set to 
RECV_REQUEST). A response field uidicates which process is 
t(.i be iiosted the result of the message. For example, when 
jjrocessing the RECV_REQLfEST message, I be ]/(> module pro- 
cess will iiost a response back fo iheiaocess nieitfionefl in 
the response field. A fiata pointer field, a size field, and a 
recover fiekl associate a bloc k of mcmioiy with a mc^ssage. 
The tecover field indicates whit h process is to be ut^tific^d to 
iTCover I be memory bhnk when it is iiu long*»i' necdetl. The 
use of associated data in this nuumer ^dlows tJie firmware to 
pass data blocks from jirocess to process and kU the final 
prcx!ess recover the dala propiMly. 
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Message 
Field 

Tnkeii 



Seiifier 


32 h\tE 


Response 


i}2 bits 


Data 


32 bits 


Poinler 





Data Size 


:32bits 


liecov^r 


32bii-s 


Flag 


8 bits 



Misc 1 
Misc 2 



Table II 
Message Structufe 

Size Description 

16 bits Message identity. For example, 

RECV ^REQUEST UKlicaU's tliis im^ssage 
is a requesl lo ri'teive dala. 

Sending process's irientity. 

Identity ot^flie piocesw lo receive Ihe 
response to the iiie!>sage. 

Pointer to an assoriated data block. 
F{ir exanH>le, iliis could ijoint to a 
block otinpuf tiala [urn RECV 
nies^iage. 

Number of bytes f>r data associated 
with [|ic n message. 

Idciifily of the proces?; to recover the 
asstjciated data. 

Indicates whether the message must 
be respf>iifled to ur data must be 
recovered. rfarespt>nse message, 
indicates if faihir e^ OK, or an 
unknown message type. 

32 bits Mcssiige-specific infonnation. 

32 1) i i s Message-sp ec ifir info ri na r I on. 



Semaphores. Semapliores pro\ide a mechanism to restrict 
access to u shartxl resource (often global variai)les) tt^ one 
process at a timt^. They are amilogous to a lr>ck on a doon 
Semaphores can he tust;inliijted, locked, ^md unlocked. 
Therc^ are only a few crilical uses of semaphores in the 
system. One is for tlie exclusive use of global eonfiguratloii 
data. Another is for the exclusive rise of tht* f^eneral-puiyxise 
meiiKiry pool 

Delayed Procedure Calls, hidividnal hmctions can be exeeutetj 
ai a later tinix^ \1a the otjerating syslein. Tliv f>perating sys- 
tem niaimaijis a hst of functions to be executed mid at tlie 
appropriate tune will execute the functions at a low inter- 
rupt level Processes can take advantage of tiris feature 
to exetitte critical code at a higher-priority iulernipt level, 
Interrujit sei*\1<e routines can take advantage of lliis featme 
to execute noneriticai code at a lower interinit>l level Since 
a list of functions is maintauied by the operating system, 
delayed proeefluie calls can hv canceled. Tlie user interface 
module uses deferred procedure call.*^ in inipleitteut key 
de^joiuicing. The defencd jjosl feature of message posliiiH 
is implemented by using ileferred procedure calls. 

Interrupt Service RQutines, Interrupt routiners are statically 
installed. In practice, interrupt routines often just post a 
message to wake up a jDrocess. For more sojjhisticated needs^ 
imernipt routines vm\ logically suspend imtil a subsequent 
interrupL This facilitates designing serial and sequential 
iniernipl sfate machines. 

Memory Management. Metnoiy management is stiictiy static 
witli few- exceptions. The operating system does not provide 
any sor1 of fimctionalily to allocate or frei' memon^^. The 



reliabihly of the system w'as gieatly enhajtced t^y designing 
il for static meinorj' ust^ The 1/0 module does provide 
for the use of its output ring buffer for general -purpose, 
restricted memory allocation with functioti calls such as 
Ring_Request[) and Ring_ftecover[). The restrlcUons vrcre im- 
posed for sinrplicity and because of the ring nature of the 
buffer: mt^umiy nmst be allocafed in mnltit>les of 1 bytes, 
memory must Ix^ held for a veiy shoil time or the efficiency 
of the outpul buffer will degrade, and although menioi^' can 
be returncfl j>iecemeal, tiu^ pitn es must start on 4-byte 
fjoiuidaries and be multi|)les of 4 b^tes. 

Firms (Soft Constants). Ajirfft Is a concept add(*d to the finn- 
ware desigu h> Jacilitale adjtistiiig constsmts postrelease. 
( onstants that may \we(l adJuslmenS after Hk^ printer has 
been relea.sed for inaiuifacluie are grouped together in hsts. 
Access to these constants is via tlie FirmO hmction caU. RrmO 
is called with a hst and a constant identifier. Firmd looks up 
and 3ettims the desired constaiit. Rrm() also <|uickly scans a 
small const airt replacement list. Tins replac^enient hst iu- 
t ludes I he original list mul constant identifier alojig with a 
new value foi' the constant. If a replacement exists, FirmO 
returns the replacement. Tlie constant replacenient hst is 
sttjred in nonvolatile meuirny. (leneraHy tliis would occur 
^is a final stej) in the manufacturing process. 

I/O Module 

Fig. 3 s iio ws how t he l/( ) m o d u ] e i s stn i cm r e< li 1 1 1 o p [ i y ^ i c al , 
link, and ai>i)lication layers. The physical layer deals with 
the signaling on tlu^ parallel cal^le. The hnk layer deals with 
logically dividing a single cabk^ into multiple logical chaimels. 
The ai>pIication layer deals with the various data, conunatidj 
status, and pacing aijplicalions uecessaiy to implement the 
printer features. 

Physical Layer — IEEE T284 Parallef Port. T!ie conned or on the 
Ikack of the 111^ DeskJet 82(K' |jj inter connects to thepaiallel 
printer pon on the PC. The IEEE 12S4 bidirectional parallel 
port speci Heat ion is supported by dedicated hardware and 
finnware. Marrhvare pei forms the btisic ciata transfer of bytes 
from the PC directly into KAM Finnware supiioits the IEEE 
1284 overhead retimred to {nit tlie pott in the jiroper transfer 
[nodes and to transfer data back to t!ie PC. 

IEEE 1284 redetnies the traditiontil parallel port lines BUSY, 
N FAULT. PERR. and so on to pennh faster data transmission 
and to iillow tiata In be sent back to the PC from the printer. 
Faster data rates are achieved by hiiving the host only 
pulsing the NSTROBE line until tlie printer rmses its BUSY line. 



Appticaiion 
Layef 



Link Layer 



Physical 




Fig, 3. Lnyr'TFil ]/(. J structure. 
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s - Stan of Paclt«i 
Fig. 4. VUnk packet lrav«^ling oiMhe physical cable. 

Tniciit iotially the NSTBOBE linr liad to be held down for a set 
minimum time period (which was a relam^ely long limei. 

The IEEE i2M o\erIiead for mode switching Is iniplemenled 
as a separate firmware process bi tlie system. To acliie^ e the 
IEEE 1284-reqniretI ;i5-ms response time, Uie procress runs at 
iiw higliesl ixriority in ilie sj'Sten^ Tlie process monitors the 
]jai'aJlel [iort lines and responds to changes by maneuvering 
til rough a constiyit state table. Tliis stale table includes in- 
ronnafion tm whal to watch for on the parallel lim\s. liow to 
respond on the paniOel lines, how to j^et iuu\ retrieve data 
at I he appropriate times* and which states can be exijecied 
next. 

Link Layer — VLirk. Tiie link layer pro\ides a simple logical 
chcQinel i>rotoc-ol. To prevent Uie printers input buffer frt.»m 
conii>letely fiUing up and preventing conummicatioii wilh 
the PC. image data and conm^ajid data aie separated into 
nvn higicnl channels. Each of these twf) lo^cai channels is 
nuh\idually paced to prevent one from blocking the oiher. 
To separate the data and conuntmds into two logical chan- 
nels, the raw b>1es are i>acketii;ed so tlun a channel nutnl>er 
cati be assigned to each piickel. 

A new IIP-proprietai>^ link-level protocol, \T»ink, replaces 
the more sophisticated MLC protocol usefl in the other 
DeskJet anti LiiserJel models. X'Link r^xiuires c onsirierably 
less code, ctm be substantially iinpleniented in haul ware, 
aiid doesn't rtHiinre a tiidireciioniil link. 

Fig. 4 shows the VT-ink packets. To packet ize the data, VUnk 
adds ft) ur adrliiional header bytes to each block of data. 
First, a siaii-of-|iacket character S, isst^nt. Si*coad, oni- hyle 
specifying tlu* channel rmniber is sent. Tliird, a word is sent 
hidicating the number of data bytes to foUow. A packet can 
contain up to t34I< bytt*s f if data. Custom I/O hardware strips 
off I he four header li.\1^es, uses \hv channel luuiitier to select 
a ring InilTer in RAM in whiih lo store du* data, ami subse- 
t|La'ntly trimlei's the ilala into the ring bid'fei' l>y DMA. 

Tatile 111 shows how channels are allocated in the DeskJet 
S20C\ Incoming packets arrive ftjr either cluiimel or chan- 
nel 1. rharmel t) is used for image data. Channel 1 is useii for 
conunands. ()uig<>ing packets ;ii'c transmitted iLsing chajmels 
1, 2, and 1-iH. Outgoing chaunel 1 is ii.sed for reS[»onses to 
connnantis. < )titgf>inM thantiel 2 is used for the periodic autfJ- 
status information. Outgoing chaimel 128 is used to supply 
pacing information to the host PC. 

Ring Buffers. Tv^<j ring buffers store the two incoming data 
slrcMuis ht sni the host PC. t)ne stores the image data arriving 
on VLink chiumel d. The other stores <Ttmmands arriving on 
VLink c I tan n el L The ring buffet^s are implemented with a 
combijiation of tiisfom luirdware aiKi firm ware. 



Table 111 

VLink Channel Uses 




Input 

Use Cltannel 


Output 
Channel 


hnage Data 




Commands and Resjwnses 1 


I 


Periodic Atitosiatus 


2 


Periodic Ring Buffer Pacing 


128 



Fig. 5 shoH^ a diagram of a single ring buJTer The custom 
ASIC selects tlte ring buffer in wlrirh to deposit incoming 
data based upon the chiumel number in the \Tink header 
Incoming ilata b>1es cire placetl into the l)>1e i>oinHMl to by 
the rings till register, and tlie fill register is uicrememed. If 
the fdl register passes the liigh wrap n?gister. die fill register 
is set eciual to the tow wTap register. Once the fill register 
ef^iials die recover register, no more uiput is pennitted. /\ny 
fun her input for this ring buffer will cause tiie jianiJIel ports 
BUSV hue to be set high and remain high until the recover 
regLster is clianged. 

For the command ring buffer, tJie grant register (a ilnn ware- 
only register not in the custom ASIC) is used to n^ark the 
data that has l»een granted to the parser. When the com- 
niaml associated with the data has completed executing, 
its d^ita is recovered by atlvancing the recover register. Tills 
t>errnits fiinher data input. 

For die image ring buffer, tlie ASIC ad\aiices the reiov^er 
register as it pults data out for the print cartiidges. This 
occurs while du* print sw(h»|i is taking j)hice. Tliis perniKs 
funhfM itipui lo occur tjn the image ihannel in iiLse the* V>ulfcr 
was jjrevdously fall. The grant register does not exist for the 
image IjulTer. 

A Qvh-d rhig buffer, <jne that is implemented entirely iu llrm- 
ware antl has no custom A81C registers. Is used for ait output 
buffer and occasionally genenil-j>ur|)4>se inemoiy allocation. 
The same ring buffer design is used, ihen^by reusing the ring 
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buffer utility functions. Tn this case, memory is granted to a 
process, advaiKing Ihc gnuit register. Eventiiaily tiic memory 
will be recovered, advancing the recover pointen For this 
output buffer, the input register has no use. 

All enhaiiceuient useful for both the command and general- 
puipose output ring buffers i.s t\u^ abUily to ret over blocks 
of memnjy out of order This is fricilitaierl by managing stih- 
rerovered bloclis of memory t>et ween the grant pointer and 
the recover pointer. With the rule that all memoty requests 
aJid recoveries must be restricted to nniltipleH of 4 tiyies, 
sutjret overed IHrH/ks can lie implemented using oitly die 
RAM contamed wiihin the recovered blocks themselves. 

Output. Ft>r output, \\w main control I/O jj roc ess receives 
SEND messages from tlie <it her processes wilhin the system. 
Like input, output is formatted as VLink packets. Three 
VLiiik diannels iire used: ch;iimel 1 to transmit command 
respoases back to the host, channel 2 to trtmsmit periodic 
autoslalus back to the host, antl channel J 28 to transmit I/O 
buffer pacing information biick to die host. 

The design handles cases when bitiireciional UO is not avail- 
able. This can happen when the printer driver is busy and 
not coomiunicating with the |iarallel t>or1. wtien |}ic driver 
is not rriMning at tdl, when an external device iising non- 
IE EE-1284-<^om|>li ant cables is between the pririter aufl the 
host, when the VC does not suppoit IEEE 1284. or when 
there exist niiscedaneous hardware mid softwme conflicts 
with the parallel port . 

In cases where lii directional I/O is not avaiIaV>le. output is 
prevented IrcHn accumulating inside the |n iruer^ t>y buffering 
at most one packet pei VXink channel Ajiy i>rev ious iiackets 
are aut oniatitrally recovered back into tlie system ainl uevi^r 
transmitted. This pritnity scheme ensures that the [lost. PC 
always receives the latest status. TIic only repercussion for 
bitUrectional systems is tliat the rhiver caimot send nmltitde 
queries to the printer wittioii! wailing for each individual 
re^Tiponse, 

Image Data. A key and early concejit of PPA is that data aniv- 
ing at the printer will alreafly be formatted for the custom 
ASIC hardware controlling I fie jnint cartridges. In other 
words, the finiiware mid m ic re j processor in the printer do 
not process the data, nor rlo they itiove I tie data in RAM. Tlie 
image data is transferred l>y D^LK into the image ring bitffcr 
from the ASK; I/O block tmd from the ring buffer to tlie print 
control ASIC blocks, 

Autostatus. To keeti t!ie fiost PC informed of the status of the 
lyrinier. an autostatus process periodically formats a data 
block with ihe printer's current status. This data block is 
tlien given to the I/O module for transmission back to the 
host on VLink channel 2. 

1/0 Pacing, When one of tlie input ring buffers Gils up eonv 
plelely and another byte arrives for (his full ring buffer, the 
overilowing bvne causes the |jaialtel poii s BUSY line to raise 
and hold off the host PC from tiansmitt.ijig any further data. 
Such a sit viati^Hi could prevent the host VV from tiuerying 
the printer's stains or canceling a print Job, so the priiner 
and host work together lo prevent either of the input ring 
buffers from completely fiUing up, thus allowing the other 
ring buffer to conUiiue to receive data. 



Tlie printer tnmsniJts back to the host periodic ring buffer 
status information on VlJnk cliaimel 128. The (iata trans- 
mitted indicates both the instantaneous free space available 
in each buffer and the anion nf of data recovered from the 
ring buffei-s. The amnnnt of data recovered from the ring 
buffers is cumulative. In other words. th(^ printer repurts the 
total number of l>>tes it has recoveied honi all r>f the input 
liufTers. 

This total number of recovered bytes permits the host PC to 
di^t(*miinc exactly how much sparse Ls available at any lime 
in the priiner's iiniul buffers, as long as it keeps track of how 
jiiany bytes it has itself transmitted. Tliis mechanism is re- 
quired liecausi^ dieprinter*s repoil of tJie free sj)ace available 
in tlte in[>ul buffer is only an instantaneous reading, h doesn't 
account for m\y data in transition and could Ihns give the 
host PC a false reading. 

Command Module 

The con inland module is responsible for parsing and execut- 
ing SCT* (Sleek CJomniand Prf>tocol) lommands. SCP pro\ides 
the command protocol for commimication between a PPA 
printer and its host driver. SCP is a binajy^ language (as op- 
l)t>sed In the ASC^Il formatting of the traditionid PCL com- 
mand language^}. Tlit^ general command syntax is shown in 
Table IV. Some SCP coiimiands me shown in Table V. 







Tslile IV 




SCP Command Structure 


Command 






Field 


Field Size 


Description 


ID 


2 bytes 


Identifies the cranniand 


Reference 


2 bytes 


Reference nuinl>er nse<I to citucel 
conimaruls 


Priority 


I byte 


Order in which the command is 
proc:essetl 


Pad 


1 byte 


Unused 


Length 


2 bytes 


Number of additional data bytes 


Data 


{) f.O n 


Depending on command, typically 




bytes 


contains a number of subfields 



Command Farsiog. Th»^ Parser jirocess requests niw date! b>les 
fruni tiie I/O in^Hlule by seufhnga message. ConuiiajKl bound- 
aiies are identified, blocked, and attached to an acquired 
message. Eac h SCP conmimitl is attached to one message. 
This message idenlint^s and leads the command through tlie 
system for exet iitioiL The individual messages are at first 
posted TO the Pacer process. 

Command Pacing. The Pacer process receives messages point- 
ing to raw SCP command b>tes and sorts them accorcUng to 
priorit>'. It continuously selectij the hjghest-priorit>' conuiimid 
and posts that command to Ihe appropriate modiUe for exe- 
cution (which crnild he 1/0. ntechanism, etc.) The Pacer then 
waits for die commanil to complete by wailing for a restionse 
message from the selected exeeuton 

Commands are sent to ihe Pacer not only by the Parser but 
also Ijy (jilier modules that may want n command executed. 
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For exiiniple, when the printer door is opened, a HANDLE. 
PfilNT_CAFlTRiDGE; Change_PFin!_Cartrtdge romitiaiKl Is given lo 
the Pacer for exeentian. This romniand m issued by tl\e l/l 
module. 



Table V 
Examples of SCP Commands 

Description 



Command 

PRiNTJWEEf 



C onfigure hardv%"are lo priiii 
a sweep of data 

Ijoad aiid eject 

Print cartridge change, wipe, 
spit, etc. 

C0NFIGURE_PRINT_CARTRIOGE Print cartridge temperatures 



HANOLE^MEOIA 

HANDLE PRINT CARTRIEIGE 



STATUS„REaUEST/REPDRT 
CONFIGURE_AIJTOSTATUS 

CANCEL_COMMAND/DATA 

RESTART 

ECHO_DATA. PEJ^FnRM_TEST 
SET_ALlGNMENTJNFORMA- 
TION 

ULSTATE. UI.MOMITOR 

ATOMfC COMMAND 



Synchronous status 
information 

Asynchronous status 
information 

Flush a command or image 
data 

Fiehoof printer 

M i see 1 1 an e nt J s f u nc tions 



User interface set and read 

Low-level m anulac t u ri ng ^md 
test command 



Command Exectitlon. A fliird ronimcind module process, the 
Executor, rxerii(o^> SCP cnnnnaiuls dcsipiattHi for tlit> Parser, 
(ieuerally, SCP ("(»miiiajids ttre (lelegal(*(i lo theii lesiiective 
tiKit lilies lor execution. A few conunandsT such ;is the 
CANCEL^ COMMAND command, are exet^uteri by the commatKl 
mothtle itself 

Mechanism Module 

The mechanism mothilc executes riu- niecli^inisnt-related 
SCP conunands, maintains I he* yyslent'^5 nu'chajitt^al stale. 



handles periodic print cartridge semdng needs, handles all 
motor needs and ftmctions, and prints sweeps of data. 

Tlie ptechanism module consists of two processes and se%'- 
eral interrupt service routines. The iop4evel prm^ess. tlie 
Mechanism Stale Machine, manages tiie liigli-lexei mechanistn 
stale (cover optni. paper loaded, etc.). The tow4evel process, 
the Walker/Dispatcher, manages the execution of mechanism 
motion scripts called .//ow;,s. 

Mechanism State Mechine. The Mecttanisni St^te Machrne is a pro- 
cess rliat luauiriiuis the ctirreni mechanical state of the sys- 
lem. takes the proper actions when state changes occur, and 
rettmts to previous stales iiTter as\i\chronoiLs state changes 
occtu-. Fig. 6 shows a sntall portion of the mechanism state 
machine to give an example of its hierarchical nature. 

Tlie ntechanism starts in the Entry state and after inilializa- 
lion proceeds to an Idle state. As a prin? job cotiies in* a 
HANDLE MEDIA: to a d_Paper command causes an eoir> into the 
Load statCt iHul w hen paper is loaded, to a Ready to Print .'istate. 
During these states and changes, mechanism flows (or 
scripts) are performed and the state machine responds to 
asyncJironoits events such as a prim cartridge change or 
pajicr jaiti. \Mien responding to asynchronous events, an 
asynchronous state change is made and the appropriate 
flows aie performed. The state then reverts back to the slatt^ 
that existed w^hen the asynchronous event occmTccf 

Mei:hamsm Flows. MeibnuLsin flows are small lists of individ- 
ual tneihanism it^inicliotKs. t>iji tally motor moves, to com- 
plete a higli-1 eve 1 meclianical lask. For instance, when start- 
ing the Load Paper stale, a loatl paper flow is executed. This 
flow contains a list of individual motor move commattds to 
accomplish the nnilti motor task of loading paper 

Flows ^u'e written in a ctistfint scripting iitn^uage. The reascMi 
for the custom scripting language is to piTmir develfjpment 
of motor motkin witlujut ret ompiling imtl buikling the finn- 
wfu-e set. A fltiw can hv titjwnloaded lo the tirinttT mid exc- 
cutefl, permitting an easy statidalotie mecbatiism develoi> 
men! en\ ironment. This technique is also used during 
manuiacnuing to invoke custom manufacturing nuMor 
movemeitts. Wlien a particular meclianisnt How has been 
developed, the flow am be incoi-fioraied into the firm ware 







Any Stare 






Fig, ik A iHH'fton ipfthi' tnerha- 
nuHUi module's tiienu-diit^al siMe 

imirrliinp. 
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set and executpff jusl as during tk^T 1 up nioiit. Table VI lists a 
few of til e avii \\nh\v 11 n w t ' { ) 1 1 1 niaiids. 



Table VI 
Partial Ust of Flow Scripting Comifiands 

Flow Opcode Parameters 



Carnage Motor Move 

Paper Motor Move 

Wait Carriage Motor Done 

Wait Paper Motor Done 

Jump to Sub Fiow 

Goto Flow 

Fan 

Relative Branch 

Exit Row 



Speed, post! inn 
Speed J distance 



Flow ID 

Flo\^ ID 

Oil/off 

Condition, brmicli distance 



Walker/Dispatcher. 1 he tlow ,sc"iipl L^xeciitor is a process called 
tiie Walker/Dispatcher. This process receives messages witii 
either' nddresses of flows to execute or addresses of comple- 
tion routines to execute. When given an address of a flow to 
execute, tlie Walker/Dispatcher looks up each opcode and calls 
Hie corresjiondiit^ riinction to periorm thc^ ojjcode. ^"hen 
given a completion routine to execute, the Wafker/Dispatcher 
executes the completion nniliiieantl then retries any oiX'^»<:i*^ 
that had to wait tor a eon ni let ion he lore continuing. Tliis is 
shuilar to a microprocessor retrying an Instmction after a 
page fault is coiTected. 

Actors and Gaffers. The functions Uiat implement the flow 
opcodes have heen nit knanierl miors. Tliere is one actor 
function loi- each flow oik ode. A function table is used to 
select which actor function to execute for each flow^ opcode 
encountered. 

An actor funetjon jjarses tlie flow opcodes pEuameters, 
verities that the particular mechanism resoince isn't in use 
[generally a motor), and makes the apt)roi.)riate ciill to the 
motor control code to stail tlie jj roper niottn niovenicMit. 
If a resom'ce is m use preventing the actor from con I inning 
execution, execution of tlie actor teniunates and is retried 
when a resource becomes free. 

A completion routine is pa?tsed to I Ire motor control code to 
be executeti when the motor ha.s completed motion. These 
completion routines have l^een nicknanie*! fjuffvrs. They 
deal w ith errors dming motion, do any fmal cleajuii>t and 
cause the script executor to retry an actor ttmction that 
couldrx't execute because of a resource limitation, (jaflers 
aren^t executed by the motor contiol code, biit ratiiei are 
posted to the WBikef/Dispaicher for execut iojL 

IWIotor ControL Motor cont njl is art omplished \ia a contbina- 
tion ot process and interrupt threads of execuiloru Generally 
the execution that occurs m process space would include all 
mitial motion and interrupt setup calculations. A transition 
is made to the intermpt space of a sek^cted hardware hiter- 
mpt with a call to lnterrupt_Comext(), Once in inteniif)t space, 
etdls can be made to Wait_FDr_Jnierrupt[) to effectively suspend 
the execution unti] the associated intei*rupl occurs again, 
Fl^xecution continues, including imy adtliiitjual suspensions 
for additional interrupts, until time to iJtrorni I he Walker/ 
Dispatcher flow executor ot com [3 let ion. A message is postt^l 
ro the Walker/ Dispatcher with the address ot the appropriate 
completion routme, the gaffer. 



.An example of a mcjtt>r control function using such a combi- 
natJon \s CM^Move_Ancl_HolcH), which moves the carriage mf>tor 
to a specific location, holds there, and posts the given com- 
f)letion roiilljm. CM_Move_And_Hold() is called wiUi a motion 
acceleration profile, a final i)osilion, a \i}^^' other motor ad- 
justments, cUid a pointer to a completion message. The func- 
tion does some preprocessing to account for pre\ioiLs motor 
motion errors, lo calculate I he direction and distance to 
tt*avel, and to sek'ff acceleralion mid slew pmametcrs. The 
tiTinsilion to interrupt space is marie, Tlie function then go(^s 
througli three loops: one for acceleratmg, one for slew^ing, 
ajid one for dfX'elerating. Each loop eaOs Wait_For_lnterrLjptO 
and sets up the next ineiemcnlul motion request. I^mally. at 
\\\v comjjielion of the motion, the flmction posts the comple- 
tion message to lite Watker/Dispatche: process. 

Conrigu ration RAM 

The HP DeskJet 82 Of printer has a block of nonvolatile 
RAM ti\aT is tised for coriHgm ing the printer in ways that 
nmst survive shutdowns. A C -language structure is used to 
oiganize this eonnguratimi data, 'IVo small processes read 
and write the data fnun the jion\ola!ile RAM and control 
when this nmst be done. Kxuinples otllelds stored hi non- 
volatile RAM aie showi^ in Table Vll. 



Table VII 
Partial List ef Configuration RAM Contents 

Configuration Field Description 

Startup tests List of startup tests to perform 

I *ri n t. Call ri 1 1 ge S I <.n e r I pr i n I c tut ri d ge c a 1 i bra t io n 

Calibration flguies 

Page Count Count of how many pages have been 

printed 

Finn Replace- Set of constant retilacements 

ments 



Alignment 



Dual print cartridge alignment 
arljustinent factors 



Mechanism State Indication of whetlier tlie met lia- 
nism was |iroperly stored l>eJbre 
shutdo%\ii 

A copy of the nonvolatile RAJVI is kejit hi iioiTnal RAM. This 
copy is made upon startup hy the Contig uration process. Any 
process can access the configuration data copy as long as its 
access is protected by locking a semaphore. After a process 
has made any change, it must send a SAVE_C0NFIGURAT]ON 
message to the Configuration process. Configuration schedules 
the noru^olatile liAM update by sending a message to the 
NV RAM Process. 

A second process actually reads and writes the nonvolatile 
RAM. This is to a\ old holding up tlie system, since the physi- 
cal reading and \\ riling of nonvolatile RMT takes time. Tlie 
NV RAM process executes at a ver>' low priority so that non- 
\ olatilc RAM is only updated when tliere is nothing else to 
tlo in the system. 
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Power-On/Shutdown SequenciRg 

A liriK'i^s kiio\\"n as Pstate is ils*h1 to raciliiaie a romrolNl 
st;iniip aiid shiitdowTi jjnx'ediirp This is iiniM>rtajii lo eiLsiire 
that (Jependendes are haiicUrd during startup atid shutdown. 
To arc^oiuplLsh this, a phascHi .startup f>r shmdo\%ii is iis^^d. 
During st;tnu|> phase 1, procc'sses eaimot assiinic that any 
olhcr prucetssi has had a chfuice to execute any code. Each 
process iniliaJiaies only lis own iniental data structures, 
Duniig startup phase % processes can assiune that all of the 
other processes have completed their phase 1 code, TTiere is 
no liard and fast nile gfj venting what is to Ije done at each 
phase. It is simply kno\\Ti that within a given jihase. a jjro- 
cess can assume that all titfier processes have completed all 
pre\ious phases. Similar procedtires are used iii shtitdowii. 

The sfaii up sequence proeee<te as follows. At startup, Psiaie 
hroiidcasts to each pro<^ess fl*\sirjng staatup iofonuation the 
START message. Prw^f^ssps indicate they wani die stimnt) in- 
formation l)y Ijelonghig to the Startup class. Included within 
the START message is a phase nmnber. The first time START is 
liroadcast, the phcLse field b set to 1. Once each pi'occss hiis 
resjjonded to this first START message, another START nu^ssage 
is t>roa(kasf, tltis time wilh tJie pliase field set to 2. mai agmn. 
eacii piocess ^'^ill resi:iond. Fintilly, once all t^hi^ses lia\ e \n^en 
completed. Pstate broadcasts the message START„S£QU£NCE_ 
DONE. At this point, all processes can assume iliai the system 
is operationaL 

Internal Test Print 

A. small prt>cess is used to perform the internal tesi iinni 
feaiure. The Test Print process waits until il is lianded the 
DO_TEST message, h then tcmtJoraiily disahles I/O and lakes 
tjver the image input hutler. Illlingi! up wiUi test i>nnl data. 
To phnU this process builds its owji HANOL£„MEDIA: Load, 
PRINT.SWEEP. and HANDLE MEDIA: Efect coouiuinds and sends 
them to the command Pacer rorexe< ulitm. Finally I/O bidfers 
are restored and VO reenaljled. 

User Interface 

The user inlerfacc moilule, 1 71, is dcsij^nc^l lo resijoild to 
Stimulus of vmitnis e\ enis ha|>|)cning in die system. A slate 
taljle is used to map a sthnuhis to a particular acticnt and 
suhsetiuent stale. Each s!ate also inchides a set *jT exii con- 
dirions. '^Tlu' process's nmin function is hi n'sptnifl in LJI EVEMT 
messages which aie posted when front-panel < lumges occur. 

The primer co\iT door and huiitjus generate Lnlemipis when 
I hey clumge. Each of these luLsan Inien'npt sejTi< e rcmtine 
that lakes care ofdebcjimcing, using deferred procedure 
calls, and posts a message to the Ul process indicaljog t!u^ 
event chajige. Tlie Ul t>ificess ihen marches throuj^li its state 
table to make die uitenial change to \liv printei- and the \isual 
change Id file user. 

Printer Status 

Primer stattis is managed by the status modide. This mothile 
receives update indications from the rest of the system, 
composes sjjecific status responses hark to the host P(\ and 
composes pt^riotlir aulosiaias responses hack to the host PC, 

Autosiatus. Table VI II shows a sanitde of I In- unlostatiis dala. 
Amosiaius is a fixed structure <if Itirs mtd nuu^'ric fields that 





Table VIII 




Examples of Autostatus Fields 


Field 


Description 


Ahsluad 


Paper load failed^mfJSt hkely out 




of paper 


Door Open 


CG\'er df K>r open 


Media Jam 


Paper jam delectetl 



Print Cartridge Diial print cartridges not property 
I'naligned aligned 

Last Error Code Lpast error encountered by the 
fimtware 

is !ransntitle<l back to the host on a periodic basis. The Auto- 
status process is responsilile for building the transmitted 
data block aiul bantling il oxer to the I/O module for trans- 
mission to the host. 

Status Update. UPDATE_fTEM messages are posted to the status 
mocluk' lo update st^ecillc fields in tlie autostatus block. At 
this time adthtional notification to the rest of the system can 
be made l>y the status module. For instance, the stattis mod- 
ule \\'ill lUJiify the L7I module of paper misloads, co\er door 
openmgs, missing print cartridges, and so on- 

Status Responses, Tbf^ host PC can alsti request specific in- 
IVjrmation Irom I lie prinien The Status Request process receives 
status re<|uest <"ommands trom the Command Pacer mfidule* 
formats the residi, and again hairds the data over to the I/O 
modide for transmission to the host. 

The Simple Process 

Tliere are small fimctiotts or coniinaiids that must be exe- 
cuted Ihat <lort't regally fit logically ititt* any tif the modnles in 
the sysUMu. Logically for modnlarily reasons, they inigiit 
each Tonri liieir own niodule or process. But in an effort to 
consene ROM and HAM, lliese fimctionsbave been com- 
bined into a single process. Tal)ie IX shows a partial listing 
of the fimctions handled by die Simple process. 



Table IX 
Examples of Simple Process Functions and Commands 



Function 
SCPCmdNV RESET 

SCPCmd SET ALIGN tNFD 

SCPCmd SET PAGE COUNT 
SCR Cmd REPLACE FIRM 
Call bration PiinctionaUty 



Description 

iic 1 [ li I i al I zes non vol ati 1 e RAM 

to default values 

Si ores the print cartridge 

alignaneiit information 

re<'eived from ihe host 

Stores a new \'altie for the 

page counter 

Stores a new value for the 

s|)ecifted fuTn 

[*(^rlbrms periodic calibration 

hmciinns 
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Flash Memory Support 

To provide for iiriiiwaie upgrades during tlevclapjiicm ajici 
the early stages of oianufac'turiiig, flasli memoiy is tempo- 
rajily suiisril uted fur ROM Tfie finish Tiicmoo^ can be repro- 
grajiiiiipd whenever a new firmware set is available. 

The St P command laiigita^e provides a command, EXECUTE 
DATA, whicli causes Uie finnwaie to Jump K) data flowiiloaded 
into the intage l>uner. Before mtddng tiiis Jump, the priater 
shuts down all interrupts to guarantee that none of the exist- 
ing firmware is still exeeuiing. To rri>n^^nm^ (he flash mem- 
ory the dowiiloaded program roiifaiiis IhAU die code lo le- 
program the flash memoo' and the (!aia H> program inlo the 
Ha^sh memoi3^. Wlwn tliLs do\Milf>ade<:l pRjgrmn has eompletetl 
reprogramming the flash memoiy; it executes a 6800(1 reset 
histruction, effectively retuming control fiat k to ffie Hash 
meniory and beginning execution of the tiewly installed 
firmware. 



A ck n o wl edgme n t s 

S|)eclaJ ackiU)\vledgnienl,s are in order for the IIP DeskJet 
82flC firmware team. David Neff anti Eric Ahlvui managerl 
the team ai different times. (Jene Wehjoni started on the 
archil etl are early and hatl a key role in leading the concefil 
of using message comnmnication processes and designing 
the operathig system ijiterface. Mark Garboden led the de- 
sign and implementation of I he Walker/Dlsparcher iJong with 
its associaled flow scripts Ui\ moiion control. John Van 
Box! el designed anci impUMuenled the r)p era ting system an(i 
the low-level motor control cocie. Rose Elley had the critical 
role of providing engineering support and tools to enable 
RtSc f ) . tni si o n 1 e r assi i ran ee . an d m a n u fa c I ur ing t o move 
tosvards the new P1*A artrlhlectnre. iMehnda (Jrant did user 
interface desigti. Carl 1'bojnpsen, Bob Callaway, and Hugh 
Rice did significant design ^ consulting, and coding in the 
areas of ])rin1 cartridge, motor, kuid analog fmic^tions. 



Conclusion 

The HP DeskJet 82 OC printer fini^W' are arclii lecture success- 
fully met or exceeded all cost, quality, sciiedule, iind Ihrough- 
put goals, Tfiis is particulaiTy satisfying considering Ihe firm- 
ware plalfonn stiuled completely from scratch, with flesign 
leverage only in the mechanism flow^ scripting arena 
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PPA Printer Controller ASIC 
Development 

As the first Printing Performance Architecture printer, the HP DeskJet 
820C needed a completelv new digital controller ASIC design. The chip s 
architecture was optimized for the specific requirements of PPA. 
Concurrent development of hardware and firmware through the use of 
hardware emulators and attention to regulatory issues during the design 
helped the product meet all of its requirements on schedule. 

by John L* McWilliams, Leann M, MaeMillan, BitttaJ PatlialcT and Harlan A. Talley 



Tlie Printing Performance Archiiectiire (PPA) used in the 
IIP DeskJet 82{)C* printer is a signiiicant step foni-aici from 
iuiy j)rev!ous HP inkjet printer product in pro\1diivg tht* 
consumer with a higli -performance product at an excellent 
price point. Since PPA redistributes the printing tasks 
befwc*en the host and the printer a complete redesign of the 
digital controller ASR' in the printer was reqitireil. Tiiis 
retlesjgii cilort took into account the overall protiuct con- 
straints of cost and time to market as welJ as £tll applicable 
govern nient regulations. The result is a highly integratefl 
ASIC I hat iuiplement.s all digltiil fiinrtions perfonned t)y ilie 
HP Desk-Jet S20C on a single t'hiiK This high It-vel orintf-gra- 
tioti signifjcantJy decreased the i*ost of tlxe electi'onics in tiie 
HP DeskJet H2i)V compared to tJie previous-generation prod- 
uct wiiije maintaining the t)rinter"s performance. This ailicle 
dest riljes The .system t:onsiderations. engineering rlecision 
trade-offs, and development mt^thodokigies that played a 
role in tlie deveUypment of tlu* digital controller ASW tor the 
HP DeskJet 820C. 

The (tesign nf Itir ( oiili'ollcr ASIC had to btMioiic tinck^r 
numcTous const raiiits. As in any cnn^iimer-ori^niicd [jroduot, 
I he rnnMuos! rinisideraHon during design Wiis Ihr rinal cost 
to the buyet'. The Perfoniuuice Printer Arrliitecti ire. as *k^- 
scribed tn the article on luige (V was tlevek>ped \u rediae the 
total cost of the printer. PPA idk»\^-s several ojjlimizations in 
I he digiial act hi reel u re. In loday s (Tunpetitive environniejU, 
liuu* lo market is nearly as critical a constraint as cosh 
Meeting the tune-to-market eoiLstraint required concurrent 
developnieni of hjirdware and finnware and a biig-free ASIC 
at n**flif3t tekvLse. Tlu^st* net'ds were atklresst*d by vLsing luud- 
w a re t miu 1 1 ii i < >rs d ur i ng t le vc* 1 1 > p n i en t . Fl j i a ! 1 y , 1 1 1 v p i' i n b ^ r' 
had t(j meel ov excee(i all government regidatkais uk liahng 
those pertiiining to EM] aial ESD. Taking these needs into 
ac(*onnl during the ASIC dt^sigu helped the prodttct pass all 
reqiiiremeiUs on schedule. 

Digital Arrhitetlure 

li(74 itdU'ss of thtHr specific type, iill printeiis require several 
pieces of digiial liarciwanv These pknes include a tni<TOfir(^ 
eessorto contn*! the print er» KAM for data, UOU tor lit in- 
ware, and custom digitiil k>gic for i)rinler-speeille functions. 



By optimizing each of these piecx^s, signiricatil cost savings 
were realized in the digital ASIC. 

PPA significantly reduces the cost of the printer by o|i!iiuiilfy 
partitioning the piinting tasks befween the software ninniug 
on t he host raid the hardware and fimiware nuining in the 
printer. The paititioning is done without sacrifu tug the 
printer's performance. All tiisks tliat vim be done on the host 
computer without severely tiffecting api>lic"ailon (jerfonnauce 
are dtjue in the driver Tasks with re;il-tifae constraints ;ue 
performed by the hardware aiid firmw^are in the printer. 
Because the host performs the tn^jority of the data mtmip il- 
lation, data that is sent to the primer is in a format that 
is %a^ry close to the final form ust»(l to fin^ the tirintheaily. 
Because of this, tlit^ dighal architecture was rlesigned wiili a 
guiiihig principle of "the prwessor does not totich the dala^" 
Once this principle w^as adopted, the ASIC teaui was ahlt* iu 
make several itujjoitant design decisions. 

First, a relatively low^-power processor is all tliat is needed, 
since tlu* processor does riol niaiiipiilale the data. .After sur- 
veyi n g t h v nvm 1 a Id e m ie n ) [mn 'esst us , 1 1 1 e b >-M 1 1 z vt *T*si on 
of the MtJtorola tBECOOO was c^hosen as tire best, fit, St^cond, 
shice the rntmber of tasks tlie finnware performs is limiiecl 
the code si^e vm\ be keixt small entjugh that a RClM witti all 
rirniwan* can be integrated on the ASIC, eliminating the 
need for iu\ extemal finish memoiy or ROM Third, all data 
maniptilat ions iwed to be done in hardware, which limits 
those niatii|UJlations to being rel a lively simple. Finally, the 
uieniory requirements ju-e limited — a IM-bil DliAM isstiffi- 
eient for the data needs. The DJi\M holds all finnware vari- 
ables mid stacks as well as all iJritUing data. Even with tiie 
DRAM tloing double duty, the memory' baridwiiltli require- 
raents of the architecture are fairly low; aji<i tlie product is 
able to use a kiw-f^ostp lM-bi(, nilibk^-witJe DRAM, 

A lilock diagram of the HP Desk^Jet 820C"s digital architec- 
ture is sliown m Pig. h The digital electronics consists of 
tltree main components: the digital ASIC, a IM-bit DRAJVl, 
and an optional extenuil Hush menujry or ROM, The dighal 
ASIC eotisists tjf a OHECtKlb microjuoeessor, a BlK-byle 
ROM. arjfMHIO-gaiesiandurd cell block, and a IK-byte SfL4JVl 
used as a data eaclie. In addition to the external ntetnory 
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components, the digital ASIC is connected to the t/0 con- 
nector (IEEE 1284 J, the printer motor ASIC, the printhead 
ASIC, mid ail optical encoder which provides caiTiage 
position information. 

Ttie majority of ttie standard cell area is devoted to tlie data 
path, wtiicli is tiie path the data f(5llows as it moves from rhe 
I/O comieetoi". tlirough the DRAM and SRMI, and up to Oie 
pen ASIC. The remakmig logic is used for inteifacii^g to the 
microprocessor^ for controUiiig motors, and for keeping 
track of the cmTent carriage position, j\Il memories, includ- 
ing registers in the standartl cell block, aie nieinor>' mapped 
into the 68EC0OO's standard address space. 

Flajsh or ROM 

The ASIC is designed to be able to read code for the proces- 
sor from one of tliree sources: a flash memory de\ice. an 
external mask-programmable ROM (MROM). or die internal 
ROM The reason for the three sepaiate soiu-ces is to better 
meet time-to-market constraints. At Ibe beginning of the 
manufactming ramp, code was stored in tlasli jnemoi">. That 
way, final firmware did not need to be released until just 
before the start of the ramp. As soon as the firmware was 
stable, it w^as released to both the MROM vendor and the 
digital ASIC vend<:)r for prograj timing into the internal ROM, 
However, MROM lead times are nnich shorter than general- 
purpose ASIC lead times, so MROM pails were available 
much sooner than ASICs with properly programmed hitenial 
ROMs. Consequently printers were built with MROMs tor a 
period of time iniTil ASICs wnh final firmwaie were available 
(MROMs are about half the cost of flash parts). 



Prrn* 

Cartridge 

ASIC 



Fig. 1. HP DeskJet 8200 con- 
trolliir .ASIC lilock diagram. 

Motor Control 

The IIP DeskJet 820C has three motors: a do motor Tor mov- 
mg the carriage across the pai:jen a stepper motor for pickuig 
and advancing the paper, and a second stepper motor for 
controlling the pen setvice station. The stepper motore are 
contxoUed in an open-loop process by the firmware. The 
film ware controls a stepper motor move by writing appro- 
priate phase and pulse width data to registers in Ihe ASIC, 
Hardware then generates the appropriate signals for tlie 
motors. The phase and pulse witiib data determines tiie 
direction and speed of the moves. 

The carnage motor is controlled by a finnwaie-btised con- 
trol loop that monitors the carriage position and atyusls tJie 
motor control signals appropriately. The caniage position is 
determined through the use of an optical encoder. The opti- 
cal encoder consists of a light emitter-detector pair with a 
plastic encoder strip betw-een tliem. As the carriage moves 
across the i>ai>er, the light emitter-detector pair senses that 
it is moving along the |)lasric stiip. and sends some signals 
to the ASIC. The hai'dwaie in rhe .^IC takes this iiifoniiation 
and uses it to keep tiack of the ciuTent caniage position. 
Using the carriage position, the firmware tracks the car- 
riage's speed and acceleration and a4justs the motor energ>^ 
appropriately. 

PPA I/O Packet Format 

The data from the host conies to the printer in a simple 
packetized formal. As shown in Fig. 2, the packets are made 
up of two pieces: header infontiation and data. The header 
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Header 



Data 




Ftm Byte 



Usi Byte 



Fii^. 2. ['[*Adali3 packpl Ibima! 



ijifonnarion consists of a siaJi-of-paeket (SOP) l>yte. a chaii- 
no I bytt^ and a two-byte data-sixe field tliat reflects the 
mitiiber of byte.s m the data field (0 to 65K)- 

hi lh(? FTP DeskJet 82(K.'. packets from the host may contain 
iHic of two t>pcs of information: conunand data or iina|*e 
data. Tlie rliannel t^yte tlcltTniines which typc^ of data us 
rontaincd in ihe packet (hence, in the HP Dt^sk^Jet B2tX'. tJie 
chiutnel byte will be one of only two di.sthict vjUnes). Cimh 
numd data con till n.s PPA piinttn'runirol comnuinds. while 
image data contaiiis intonnation tiiat is to be printed on a 
jjage. The image data that is sent to the printer is in a form 
that resembles a l:)itmap of the ima^e, and therefore rtHjuires 
a ininimnni iinuRnn of rt^fonnatthvs hefon^ being used In ilri' 
Ihc iirirn heads. Tn niiinniizt^ fiie anuaml nldala lliat niusi \yv 
sen! over tlie P(J cahli% image thUii is oiitionally <*t>mprossed 
before being sent to the primer. 

Data Path 

A block diagram (\f the flata path is shown in Fig. -3. Data 
enters the ASIC dirough the 170 t abk\ fiardware depaekel- 
izes it jyid separates it into thc^ image ;md c*oinnum<j t iuinnt'^ls. 
Iniage data is transferrt*<l tu unv bntTer in the IJRAIVI mid 
command data to another. tK»ili liy DMA. t'ommajid data is 
consumed by the llrnrware w^ith no hardware interfcHTnce* 
hnage dnta is movefl by I he sfrrattf fiardware from the 



DRAM to the SRAAL During the move, the image data is 
decompressed if necessary. From the SRA^l data is mo'se^i 
to a shiA register from which it is serially shifted up to the 
carriage lx>ard and is used to fire the pens, 

leput/Output 

The slandard cell I/O block implements the low-level hard- 
ware I hat lakes in packets of information from tiie host \ia 
theiJamlltH jwirr- It toniiiins h^irdwan* sup jH>n for IEEE 128^t 
compatii>iiily mode;ai<l extended t-apability port (E('P) in 
the forward direction from the host to the printer The har<:i- 
ware also supports, witli firmware assist* rcverse-chajinet 
nibble mode for sending information back to the host com- 
puter. Ttie I/O block also contains hardware tliat stri|js the 
data stream of its packet tieader information, separates the 
packetjs into command and image data, and sends them to 
the I/O DMA bltK*k. From the lieader uifon nation, tlie hard- 
ware c^hecks the startH:>t-packel l>>1e to make sure it is the 
correct value, uses the cliaimel l>yle to seleci die ajJt>roi)riate 
DAL\ chamiel and uses the size field to detenimie wtien to 
expect a new^ packet. 

The M) DM\ block receives data via the I/O interface and 
stores it into either the command f>uffer or the image buffer 
in the DP AM These btiffers are (ii^ signed as geiierai-i)ui7)t>st^ 
circular l.uiffers that can reside anywhere in the DIMM 
memory space. The command buffer is emptied by the 
processor as it executes the commands* Image data is con- 
stuned l>y the senanl hardwaiv. As data rrotn tJie littffers is 
coT^sume<l. the host is n<Jtified, via Jhe firmware, of the avail- 
able l>nffer st>a<*e. and more data is sent down. Tliis archi- 
teclurt' allows the printer to make optimal use of its limited 
nie ni oiy reso i j r< ■ es . 

DRAM Coiitrii lie r/Ar biter 

The external DRAM is connected to its own nibble-wide bus. 
Hardware arhitmles aec€*sses tu the DRAM l)t'^twefn the 
i/(> DMA hardwiut', die s*'niitil iiiUihviLre, and the tnicrti- 
pix^cesstjr llu* ju bit ration method is a tM:>ml)uiation of f>rirmty 
mid romiil-robin schenu^s. Both die PO anti dte seivanl hard- 
ware processc*s have real-time constraints that dictate the 
nmxinmiii length of time Ihv'y can he hlotked while wailing 
for at/c'i'ss \f} the DKAAl AJthongli llu^ uiirrcijinKi^ssur is U'ss 
lightly constraintHl, h is intpoitant tlial it not be completely 
locked out of the DRAM for extended jieriods of time- 
Hence, while each l>lock h^is a priority for DRAM accesses, 
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the liardware is designed so that no one unit cam hold the 
DRAM bus continuously. 

In addition to arbitration, the DRAM controller takes care 
of the low-le\ el mterface to the DRAM. It iiiteif aces the 4-bit 
DRAM data bus to the 8-bit microprocessor data bus. Using 
fasl page mode accesses, it retrieves two nibbles and con- 
catenates theiTi into a byte. It guarantees that ttie DRAM 
refreshes take place al appropriate tinies. Altliough only a 
IM-bit part, is used in the HP Desklet 820C, the controller 
also supports 4M-l>it DRAMs. 

Servant 

One of the key contributions of the PPA architecture is tliat 
it moves much of the pixel processing into the driver. The 
image data sent to the printer is in a format nearly ready to 
be used to lire tlie pens. Tlie only significant operation that 
is not done by the driver is the operation of picking out the 
inciixidual bits (corresponding to dots on tl\e paper), antl 
sending them to the pens in the correct order The sen^jmt 
logic, so named becau.se it series the i>eii hy providing it 
with pixel data^ accomplishes this by loading the data into 
an on-chip cache (the SRAM), and subsequently pulling it 
out at tlie correct tmie and in tlie correct order. The cache 
is divided into sets of swing buffer paiis (one pair for each 
color) such that w^hile data is being taken out of one sv^ing 
buffer by the pixel processor logic, new data is being loaded 
into the other sviqng buffer by the serv^ant load logic. When 
the pixel processor consimies all the data in one swing buffer 
and switches to the other swing buffer, the scnant load 
process i^egins loading new data into the first buffer. This 
process is depicted in Fig. 4. 

The PPA driver provides the pixel data in swing buffer loads, 
which are chunks of a bitmap eight pixels mde and the same 
heiglit as the printer's pens. The driver provides the swing 
buffer loads in exacdy the order reciulrefi Ijy 1 be pens. The 
ser\ ant load process transfers the data by DMA. fiom the 
DRAJVl to the SRAM as it is needed to fire the pens. During 
the process of moving data from the DRAM to the SRAJVI, 
the data is decompressed if it was sent over the I/O in a 
compressed format. 



Fig. 4. LS^viiig ijulfer operation. 

SRAM Arbiter 

The SRAM arbiter arbitrates memory requests between the 
servant load process, the pixel processor, and the micro- 
processor. T!ie arbiter implements a priority-based scheme. 
Since the microprocessor accesses the SRAM only jntre- 
QUently, i1 is given the lowest priority. On the other hand, 
data for the pen must be immediately available orj demand 
(the cmTiage caitnot be paiLsed for the pen to wait for data). 
Hence, the pixel processor i.s given the highest priority. The 
serv'ant load process lias the middle prioiity. 

Pixel Processor 

\\\e pixel t^rocessor is responsible for placing the bits sent 
to the pens in exactly (he order in which they are needed to 
hre the pens. Since the nozzles on tlie pen are staggered, tlie 
order in which the bits are needed is not entirely straight- 
fonA'ard. As the connect bits arc pulled out of SRjVM, they are 
placed in a shift register from which they are serially shifted 
to the pen driver IC on the carriage board. 

Each bit sent to the pens corresponds to a nozzle firing or not 
firing. Firing diita sits in the SRAM in byle-wide chunks. Each 
byte corresponds to eight colunms of dots on the printed 
page. All dots in a cokunn are fired before beginning to fire 
the next column, flence each byte in a swittg buffer is ac- 
cessed eigJil times, tmce for each column. Alter all eigltt col- 
umns are fired, the logic switches to tlie other swing buffer. 

Pen Interface 

Tills block communicates with the analog pen driver IC over 
a custom serial interface- The pen ij it erf ace receives data 
from the pixel processhig block ami shift.s it serially to the 
pen driver IC. It also generates the liming pulses that the 
pen driver IC uses to fire the pens and put ink dot^t on the 
page. In addition to sending pen firing data, tiie interface 
sends setup iiifonnation to the pen driver IC ttj atfjusi vari- 
ous priiUingpanimetcrs that Jiffect ptini quality. Tlie serial 
interface is bidirectional, enabling the pen driver IC to send 
back infonnation about the pens' status. For example, the 
pen drive iC is able to measure temperatures of the pens, 
which are important parameters in thermal Inkjet printing. 
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Tlib information is sent to the digital IC aiid read by tlie 
fintiware. Hnnware t!it*n uses this ijifomiaUon to actiiist 
priiuiiig u> ensure that the castomer will receive optimum 
priiit quality. 

Because of the staggering of the nobles on the priiithead 
(see Fig. 1 on p^ige 40). for each cohmni of dots on the page, 
the pen iTiust be fireci nniltiple tiinea Tlie pens must be fired 
eveo' tJJit^ ii ^^^ *>f vertically aligned no7,zles is at Lhe correct 
position on lhe page. If all noxiiles in a dot column are not 
firetl at iht* sanie physical position on the t^age. that column 
win appear jagged to the customer. Special logic in the chip 
eiisiu^es the proper alignment of the dots and therefore opti- 
mum print quality". Tlus logic uses tht* carriage position its 
determined directly from the optical encoder, which Is at a 
rehitively low resolution, ant! interpolates it up to the resolu- 
tion needed to fiie the pens. Tlie interj>olation is done by 
t>hase-locked-loop-like logic tliat measures the time it takes 
the carnage to move 1/150 inch, iuid diiides this time down 
to get the lime it takes the cmriage to move a <lisl<uiee equal 
to the nozzle stagger distance. By doini^ this, the logic is al)le 
Nj issue firing pulses to the pen at Lhe coirect time. 

Development Methodology 

The HP Desklei 820 C was developed under some ver> light 
time-to-market constraints. These constraints dictated that 
the latest C'AE tools be used to speed the development of 
the ASIC, Additionally, the projec^t team ^visited to Itave eon- 
ciuTcnt design of the h^u'dwarc and the finuware. 1'his UH'mii 
that the firmware tetun needed a plalfor[n on wliich to do 
develojjmenl before the ASIC was finished. To meet this 
need, Aptix hardware emulators were used. 

The HP DeskJet 820C ASIC vvtis designed entirely in the 
Verilog llmciware Desciiption Umgua^e (HDL), HDI^s are 
computer languages uslh\ Io describe digital c ire nils. They 
contain constructs thai allow dt^signeii> lo describe tlie fimc- 
don of a circuit rather than the exact gates that are necessary 
to implement that funclion, Tlius, UDUs allow di^si^iiei^ to 
work i\\ ix hi^luM^ k^vel ritabslrarlion th^m iji Ihc^ pasK Once 
a desigru^r has written the DDL for a circuit, a compiler 
program can syTithesize the HDL into a gate-level tlesign. By 
Wi>rkinfi at a higher level of abstraction, engineers tran greatly 
incieast^ their productivily The lime retjuired to df» the design 
of the HP DeskJet S2hr ASIC was significaiUJy decreased 
over past products. 

Since desigJiiug an ASIC using an HDL is analogous to writitig 
a piece or software, it is not surprisinj^ timl many of the 
jn act ices used by soft^vare enjsjineers can l)t^ used success 
lidly by harrlware teams using IIDLs, At the l)cgiiirvint; of the 
j>roje<1f < fjdjng conventions were established. SiinJUu' .stnic- 
tures in different designers' modules were coded shnilarly. 
I.>csi]^^ners were enconragefi \o commt^nt their code filjerally. 
f <ide reviews were held during the projecl to rind enorsajul 
to improve designers* coding t>ract ices. These techiUfjtics 
allowi^d (lesigners to look at each others code iuul guickly 
tuiderstatul it, hi addition to having obvious benefits for the 
IIP D(\skJct 820C project, the gtiod coding prat^jces will 
;illrm' (he IIP Desk.letH20C hardware tt> be t^asily h-veraged 
mh Huture products. 

Trj synthcHize tlu* Verilf>g crnie intf> standard vvW gj!ti\s, Syrv 
ojjsys software w lis used. Tliis software allows tlie designer 



to enter informaiion about the design to help the software 
produce an optimum implementation, S>Tiop.^^specific* 
scripts were used to enter tJie required infonnatioii. By 
using scripts, the designers w^ere able to make changes to 
lhe cocle. and with mininiuni effort, s>Tithesi^e the new im- 
plementadon. Just as for the original Veiliog code, conven- 
tions and templates for the scripts were developed. As a 
result of tliese techniques, in a few special cases engineers 
were able to modifj' code writlen i>y a different designer and 
s^iithesize nevr Juurlware veo' effifienfly. 

Ai\ important part of ,4S1C design is test developraenl. The 
HP Desklet 820C ASIC design team useti an HP proi>rietary 
techtwlog>^ tiiat aUowed the engineers io write test \'ectors 
directly in Verilog. These test vectors were iLsed to verify 
that the fmictionality of the synthesized design matche<l the 
original Verilog. The same \ ectors were then tjanslated into 
a format tjie ASIC tester understood, and used to test Uie 
Hnished silicon. Using this technique, a shigle set of test 
vectors was used throughout the pnjject. In addition to the 
functional test vectors, scan testing was iLsed to acliieve tiie 
desued fault coverage. Since the insertion of scan liai'dware 
and the creation of scan test vectors is done semiautomatic 
cally. Si" an testing was successfully added to the ASIC with- 
out incurring a schedule delay. The use of scan testing iM 
increase the cost of the chip because the scan circuitry 
caused the chip size to grow, but this was deemed accept- 
able when iradeil off against the time it w^ouid have taken 
the designers to write functional test vectors witJa adequate 
coverage. 

Hardware/Software Codesign 

To in eel lhe overall prtjject goal of a low-cost product^ it was 
necessary \o make cm'eful traile-offs het\\'etMi the hardware 
and firmware in the product. Functions that are realii^etl in 
htirtiware cost nioney because silicon real estiite is used. 
Puttctions reali:«ed in tlrmw-me cost nujncy because they 
require i>its in memory'. Suyce the ROM tluit holds the 11 rm- 
ware in the HP DeskJet 820( is integraterl into the system 
ASIC, its size hati a luu-d ujjper limit Also, the HP DeskJet 
82flC uses a relatively kjw-pr>wTr processor* so ttie process- 
ing Ijaudwidth available topertbnn futictions in real lime is 
Umited, With the standard cell logic, tli<* pi <h lessor, :md the 
fi mi ware ROM all integrated onto the same vh\[h optimal 
Irade-offs between the three were especially importaiU . 

To make the correct trade-offs, the ASIC engineers responsi- 
ble for parHcular hardw^u^e blocks coordinated closely with 
the finuware engineers responsible for the corresponding 
finnware blocks. This process billowed the lumlwiireengi- 
tieers to gain insight into how the firmware would use the 
block, iuid at the same time aJknved the firntw^are engineers 
Io have a good miderstundhig of the hardware, This umtual 
underslaridiiig led to better tra<kM)f Is. The harrlwrnc w^ls 
designed wit Ii. just enough funttionality to allow tlu* rirm- 
ware designer to ht^plenient the code within the product 
codesl^e ruul processor bmid width con,straints, but without 
a lot of extra hardware thrown in 'tjust in case," A set (Jitdary 
benefit of this approach wiis that the finuware engineers 
were able lo write the code for blcjcks designed tins way 
with few pn>hlems. Code for blocks not ilesigned using this 
process Iprimrmly blocks U^veraged froTn prt*vi(jns|)roducLs) 
proved tmu'h more (irohlematic Io Imng up. 
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ASIC EnmlatlDn 

Tf> meet the aggresjsive schpcJule, it was iiecfssary for the 
finiiware teani to begin inrplententing tjie code well before 
ASIC's were available rn nin the code. In tad. iimiwiue miple- 
rnerttalion bc^gan before the ASJC tief^ign was even eonijjleu^ 
T() allow this iuiivity ro take plaee. jl was necessaiy rn set 
up an enuilalion envirormvenl. TradilioiiaJ methtKls of doing 
such eniulation izK'Uuie huilcling a eitstom [ninted eir4:niii 
boai'd populated with one-tuiie-progranunable devices (aiiti- 
fuse devices, l;i.ser [jrograniraed pails, etc.) ^iiid .software- 
btLseti ojuiilation using a pre\itRis ]>roduet. Since the digits 
arcliitccture for this ASK' w;ls a signifit imi det>m1ure frojii 
any previous producf. emulation on a previous poKiuct 
would have been difficuK ai best and would have required a 
significant code jjoil vvht^n tiie ASK' bectuire available. One- 
tinie-i>rogiit[niHabie devices were not flexible enougii to sup- 
port ernulation before the design was functionally CT)niplete. 
For these retisons, the product team chose to use full-rbi]) 
IC emulatois frrmi Aptix lo support tlrmwar^ developiiieiit 
before ASICs were luailable. 

rc emulators are essi^ntially a large airay of SILAM-based 
FP(tAs (fiekl ijrof^riuniuable gate arrays ) with progranmiable 
inter(*orniect between (henr Software reads in agate-level 
desigji f(jr an li\ partibons i1 into the FT*jAs. and then 
Cleat es all the ueee.ssar^^ files It) i.irograni the F'PGAs and Ihe 
intercoiuiect Ijctween tiienu 'fire enujlator then is funclionalty 
cHjuivalent to die IC that will l>e fabricated, lite emulators mv 
highly flexible since they can lie I'eprng rammed by simply 
downloading a Jiew pattern hito the SRAMs. The juain flraw- 
back of the It' emulators is tiiat they generally iwo not ai)le to 
run at thi^ same speed im ihv filial silicon. For this product, 
the emulator rati at one fourth of the final cltjck speed. 

Using this approach, the IC team was al>le to provide tlie 
fi^rmware team with usai»le hardware apjiroxiiualely Tour 
months bt^fore silicon arrived. Since the ASK d(^sign was 
not complete at that jjoint, the first hmtlwme t)ro\1ded Wius 
only a subset of the full standard cell logic. Wliat wiis pro- 
vided was enough f*>r the firmware team to begin waiting 
and testing the oi>eratirtg system, rbe code that needed lo 
be \sTitten first. As more bhxks in tire IC" were completed, 
they were incorporated into the emulation sy.stem. 

In addition t(j providing early hardware t^o the finnware team 
for development pmijoses, the use tjf IV ennilators allowed 
the hardwai^e to be verified in tlie bill print big system with 
actual finnware before being com mil I im I li» siiicou. Because 
the enui tutors ran at ckisv lo the sy stent speed. se\eral 
orders of magnitude more clocks cycles of verification 
occured on the emulatoj.s iban with software simulaf ion. 
Also, since it was real finnware nmiung, the ASK^ w;i.s |nii 
m stales that would have been difficult or impossible to 
achieve jii sjruulalion Ijecause of the ct^mplexity of getting 
mto diat state. Finally, many imiintit: ipaled hardware/fimr- 
w^are mteractions were disco\'ered. The team was e%^entually 
able to print with the IC emulators, gi^ lug \'ei>' high confi- 
dence in t!ie fimctional corTectness of the ASIC. 

Thanks to the emulators, two problems in the design were 
discovei-ed and r]xed befoie connuitbng the design to silicon. 
Both problenrs were system htteraction issues tJiat would 
have been ver>- lUfficult to discover tbrongli sitnulatioo alone. 
Wiensilii-on airived, tmnware was almost innuediately 
bootable on it. The raily things that needed lo l)e changed 



in the firmware were things that were ;iffected by the differ- 
ence in clock speed, and these had been deliberately coded 
to be easy to cbaiige. 

Regii lal orj' Req ii i re me iits 

Because the IIP EN\skIet 82fiC printer is sf>ld in the crjnsmner 
marketj)iace it nnist nu^el alJ applicaljle consutoer electronic 
regulati(jns. Of paiticular interest to the electronic design of 
the profha t are the electromagnetic interference f KMI) and 
elect fostutit^ dfscharge [E,Srj] requirements. h'Ml occurs 
when an electronic product creates an electric field and 
interferes with the correct operation of another elechxmic 
IjrochH t. ESD occurs when an object (generally a human) 
that has biuit u]i a large static cliarge disctuirges to a second 
oljject (for exmnijle, an I C), In addition to government 
rei|uiiements on a i)roduct*s level of EMI atrd sejrsitivity to 
ESI), HI* jnaiotalns internal stmidards, which are generally 
tougher I ban the gn\ enmieni sJandards. Meeting or exceed- 
ing these inti^nial standards on eveiy product is mi imixatant 
aspect of Hi*"s repiilatiort for bigb-quality^ reliable |jroducts. 
Since the HP DeskJet 820C coidd not legally be released 
witlioul ineetiug govennneni regulatitms (Ui EMI and ESD, 
taduie lo iiiecl I hem was a signiln ant scliedule risk to tlie 
ju'odm I. There fori', both were addressed early during the 
design of die digital ASK\ 

In general, EMI results from improirerly controllc<l biglv 
frequency signals that travel a loiig chstaiK-e, particularly 
signals 1 fiat, ii'avel over cables. The I rick in designing for 
red need EMI is to control the signals with high-frequency 
content to the greatest possible extent. All 1/0 pads in the 
HP DeskJet 820C m;ike use of an HP proprietary technology 
that ci}nn>ensates for process, voltage, mid tenijjerature 
{ PVT) vai iat ions in the oiierating environment of the c liitJ. 
Tlie cfmi|KnLsatlon ensures that the pads have nearly the 
same slew rate rc^gm'dless of the PVT cnviromnent. (TV]jically. 
parts in an envlronuTcnt tiiat causes the chip to rim fjist have 
about twice the slew rate of parts in an environment that 
causes tlie chip to run slow). Of particular concern in this 
[>rotluc! were tlie digital signals that travel between tlie main 
logic boai'd mid tiie carriage board. These signals, which are 
in the megahertz i^ange, travel at:>proximatc*ly 10 iirclies along 
an nnshielik^tl iuid untwisted Ilex cable. This was deemed 
the most EMl-T>rone i^icce of the design. The I/O pads that 
thove these sigujiils were designed to have as slow a slew rate 
iis the signal speed would allow. As a backup system, pliase- 
locked loop hardware and additional logic to dither the 
systeui clock or the signals on the Hex cable was designed 
into the AS1C\ The result f>r Ibis desigti efTort was successful 
|)assing of the EMI regidations on schedule. 

The other big regulatory threat, ESD, was recognized early 
in the project by both the designers antl the ASIC vendor 
(ICHD, a division of H!*). Because ESI) events can cause 
damage thai will nnl iminedialely destroy the chip but 
insteafi will cause it to fail moi\ths {>r e%en yeai-s later; inade- 
quate ESI) ]:jrotection i an result in product reUability tmd 
customer satisfaction problems do\\ii the road, The cliip 
nectled to be able tcj withstand ESD ev^ents both before and 
after heuigt)nt on a printed circuit buanf When the chip 
is on a printed circuit board, extenml components can be 
plact^d around the chip to help protect it. However, each 
component adtls cost to the proiluct. so integratuig protec- 
tion into the chip results in a cost sa\-ings. Al«io, although the 
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Fig, 5. HP I H^skJet mnv fligjtal roiilroller h^lC. 

chips will be in a relatively controUetl L^miroiiiiieiil: bel'ore 
being put onto the printed circuit lioard. ESD e%'ents cm\ arid 
wili occxiT, so the chip needs to be able to withstand them. 

Wlien an ESD event occurs at the chip pi^is, a large ciiirent 
between the dischaige point and gi'ound is induced in the 
ASIC. Most stnictiires internal to t lie ASIC cannot witfi^itiuui 
such a large current without danujge. Theretore, a cttii) de- 
signed to witlishmd ESD has striuliires Uiat i ati withstand 
the liigli cunenl, arul roules I lie current Irnhese ^lructure,s 
rather fhaii U) I he cliip inlenials. Shice ym ASK 's only cun- 
lac I !o ihe outside workl 1% thmugh its pads, EBD protection 



devices are generally located at the pads. During the design 
of the ASIC*, the ASIC vendor assigned a de^lirated engineer 
to evaluate the ESD design. A curreuMinxititig resistor and a 
rexerse- biased diode were used in each pad to ilniii the cur- 
retil tliai can reach die chip iniemals> Additionally, i*huni 
stnictures belween \ ^j and gojund were carefully designetl 
aiuJ positit^ne<t in I he IC. Tliii^ early invohement by the ven- 
fior resuJtefi in a soH<l design d^al meets HP ESD requiremenis. 

Conclusion 

By taking into account the IIP DeskJet 820Cs overall project 
goals of low cost and tune to nuirkt^t from the start, a well- 
optimized digital controller for the printer was dehvered on 
schtHlule. The chip is specifically designed for HPs Printing 
Performance Arcltitecture. By integrating all digital func- 
tions in the printer on a single [)iece of silictMu tlie cost of 
die electJtmics in the product was givatly reduced o\er the 
previous-generation product while niaintaining or mcreasing 
perfomiance. Tlie design team used tlie latest .'^SIC develop- 
ment tools to deliver a conectiy functioning ASIC on a very 
tight schedule, Tb rough the use of hardware enudators, the 
fin 11 ware team was ai>le to begin cixling before the final <*hii» 
desigt) bad l>een released for majiiifacturing, furlJier st)eeding 
tlie printer^s design. AH EMI iuid ESD requirements for the 
prodm t were met on schetlule. A lithograph of the final chip 
silicon is sho\^^l in Fig, 5. The chip area is approximately 
SI mm-. 
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Next-Generation Inkjet Printhead 
Drive Electronics 



By integrating the functions of four ICs into one new custom IC and then 
moving all the electronics related to the pens up to the carriage with the 
pens, significant savings were realized. A simpie, low-contact-count, 
inexpensive flexible cable is used to connect the carriage to the main 
printed circuit assembly. 

by Huston W. Rice 



The projecl leain fof the HP DeskJet 850C printer developed 
tlio inany elomeiits or the pnnting system in paiallel. In |)ai'- 
ticular, tiic jjrint cartridges (called pens) were new designs, 
along vs ith the ek^rtronies thai tontrol them. As a result of 
the pens being new designs, thtur clrivt^ ^uul coalinl lettuirt^- 
ments were not completely defined, bul were changing dur- 
ing the development progiani. The residt of tliis was a sys- 
tem that worked well eleilri(*aljy, but was no( fully 
optimized Ironi a cost stantlpoini , 

In particiilarj t:wo aspects of the pen drive system presented 
dpportLinities for significtmt cost redaction, f^r^^t, the flexible 
€able comiecthig the carriage mu] pens lo the nuiin printed 
circuit assembly was very elaborate and fairly expensive. 
Secontl, the electronics that control the pens were imp la- 
mented in four different analog ICs, three of them custom 
ASiCs. 

With the advantage of being able to look back on the now 
welRlefined system needs, a new approach was selt^led. 
By integrating ibe functions of the four ICs into one new 
custom IC *nid tlven numng all the electronics related to tbe 
pens 11]) tt) the carriage vvilh the pens, significinn savings 
were realized. The new, highl\" integrate*] ASK ' is less expen- 
sive to purchase and to assemble into the product. Smc:p the 
signals are restricted to digitiil data and raw power, a simple, 
low-contact-c'oiuil, inexpensive flexible cable is used to con- 
nect I lie caixiage io the main printed circuit assembly. 

For this design approach t<) he successful in the HP D€«iklet 
S20C, severiil issues had to be overcome. PVjr tlu* greatest 
benefit^ ali of the electronics associated with the the pens 
had to be contained on the carriage piinted circuit assembly. 
Would it all fit 7 Bei*ause of a tight schetiule imd limited me- 
chanical engintH^ ring staffing, no mechanical changes could 
be Riade ui the < an iage assembly to make nu«v rtjom. An 
arklitional mechanical cojtstraint was tliat no componetUs 
could be placed on the bottom lialf of tlie prmtetl circuit 
board, wiiich was needed for the connectoi"s for the pens. 
To get the circuits to fit, all the analog IC functions bad id 
be integrated into a smgle ASiC. Could all the different func- 
tions — power control, digital I/O, sensitive anaJog-ttxIigital 
measuremetitS) power drivers — be integrated into a single 
device? If all ilit^ an^Uog fimctions from four ICs were uv 
tegraled into one IC, would there be thennal overheating 
issues in the IC? Would there be problems with radiated 



electromagnetic emissions from the digital interface to tlie 
carriage over a siini>ie imshielded flexible cable? 

Ti» provide an aspect of excitement to the inxjgramj tnice 
this apt) roach was choseit, there was no easy altt^niailve to 
fall hack upon if tbe above issues could not be dealt with* 
If this design failed, the whole MP DeskJet H2i)V printer 
program would be put in Jeopardy. 

Carriage Electronics Implenientatioii 

A key ajx- hi lecture cliange was made in tbt^ iien drive and 
control electronics iji the HP Desklel B2t)C compared to die 
DeskJet 85(K\ Tlve power supply for the pens was inodifietl 
hi two ways. First, two intlependent dc-to-dc converters are 
used to sujii^ly power to the black and color pens in tbe 
Desk-let 8ri0t\ In the f)eskJel 820C, a single pen power sup- 
i:ily is used to dri^e both the black ami (*olor i>ens. Second, 
the conlrf>l topoh>g>' of the dc-to-dc converter was changed, 
as exi)laine<l later in this aitirle. The DeskJet 850C design 
reiiuires se\'en laige cafjaciiors, two inductors, two power 
FETs, two [K^wer diodes, and several small disci ete resistors 
and capacitors. Ah of this wtis replaced with I wo capacitors, 
one inductoi; one power F^ET, one power diode> and one 
power resistor This eliminates not only the need tor several 
scjua re inch es o f i ir i n t ed c i re id I I >o art I s] > i ire 1 1 1 a t was i \ ot 
available on the DeskJet 820C Ciirriage printed t:hcuii boaid, 
but also thecost of the im needed comjionenls. 

The two [leiis in the product (black and color) must be chiven 
at d iff ere [It voltages, and the DeskJet 820C design now only 
has one t>ower supply, which is shared i)etween the pens. 
This forcetl a cliange m ttie way ivrinting is done. In the fJesk- 
Jet 850( , both pens can be th iven at miy tinier allowing max- 
ununi Ocxibility in how the pHnred image can he foniied, imd 
!hereh)re maxlnnmi speed. In tlie Desklet 82()C\ printing 
with tlie black and color pens alternates. For instance, black 
may he tirintefl from right to left and tlien t*t>lor Irom left to 
right. This difference costs a little in print speed for some 
color documents, but w^as key in enabling all the electronics 
to fit on the c*arriage printed circttll assembly. 

Several teclmifiues were used to integrate all the pen elec- 
tronits onto tlie cmriage for the HP Desklet 820C, Beyond 
the jiower supply changes, the next most im]Jortant step 
was designing a mixed-signal analog/ihgilal/power ASIC that 
mtegiates all the functions reciuired to drive and control the 
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pens. The geneial stiBtegy was to integrate all the relatively 
small^gnal electronic functions into one .*VSIC to minimize 
the the tot^ ronirM>nenf counL Ttiis boih miiiiniizes the <'ost 
and uses the nuninmin printed circuit UikUii urea on the ven,' 
j^nmll carriage prmtjed Circuit assetubly. H<ivvever, to keep 
the ASIC silicon die area under c ontro! and to nunintize the 
total power dissipated by the ASIC, several key contixinents 
are not intej^rated, Tlie power FETT and diode for the dc-io-tk* 
com erter. both ven^ iar^e devices ( from a silicon area point 
of view) are implemented as discrete devices externally. 
IVo linear regulators are also implemented with oCF-die-shelf 
tliscrete devices to keep their power dissipation out of the 
ASIC package. Beyond these parts and some discrete capac- 
itors and inductors that cannot be integrated, everything else 
is internal to the .-XSIC, 

The process of developing the ASIC vvius the tnosl difncnll 
cLSt)ect of the carriage electronics design. Because of the 
higli expei*tc»d production volumes, at least two judepenilent 
suppliers were needed. In the special mixed-signal/povvtT IC 
industry, there is considerable process variation from one 
supplier to the next. However, only pin compatibility is 
required between sources. The two ICs do not have to he 
identical Over a i>enod of about six months, the analog 
ASIC was c t)d eve i oped by UP ajid the two suppi lei's, Tliis 
allowed system design trade-offs to be made to keep both 
ASICs compatible. In addition, the ovemll program schedule 
demaiuled that the Omt jKiss of Ibis full cnstom IC* had lo 
work, because there was only linie for si nail re\ isions to the 
device before production simted oti the HP licskJet H2{)L\ 
As a result of excellent design tean^s at the stipidiers and I he 
careful c*>development t'ommmiicatioji between HPmul the 
supjiliers, I he llrst samiiles of the ASIC s w^orked with only 
a I'pw faults. Simjjle ]V mask changes ;md the addition of a 
few^ small exie^mal components resulteti in the systeni being 
completed on time. 

hi acidition to the mixed-signal ASIC', three prit^ted circuit 
board layout Jechniqiies were used loget iill duM oni|)o[u^iits 
to lit. First, a -ID uiaji of available space for t (Jinponenis on 
the circuit board w^is generated. With this, sm;UI [Kirts ctjuld 
be tuckid under meehanical comtjonerits, au<l tlie larger 
cotnponents (*ould lie t^suefully |>lace<l to avoid ittecrhanit^al 
interferenci^. SiM'ond, the t)Iacenieut of the<-otiijjorH*nts was 
wry caiefully desigjved to minimize intercom lect distances 
and the nuiuber of vias requiretl. Tliird, the classical iayonl 
placement design niles tliat govern compiment st>ac irigs 
were pusfierl or outright violated. Breaking the rules was 
justified because the alternative was chajiging tf) a two- 
sided surface mount assemi>ly process, which is a nmch 
more expensive and tuiattractive alternative. 

In lite eitd, all the parts were niadf* to fit on the top side of 
(be printed circuh board. An athied benefit of the careful 
printed circull board design is a very low-noise circuit. 
During the develojinu'Mf pnn ess, wt* tliscovtTed matiy of the 
high'cunent swhcliing circuits interfered with the sensitive 
measurement circuits. The cotnpaet layout provided signifi- 
cantly better tjerformatiee than earlier i>rototy|)e lay<juts. 

As might be eKi:>ected, the km -noise layout is also low-noise 
from an eli/(iroinagm*tic radiation point of \ir»w, Stet>s taken 
(ocntilrol du/slew ratrsof Ibe rfij^iial sij^nalson flic tlexibli^ 
r:ible also pjoved effective in miniuiizing radiated cnussiojis 
from the cal>le. 



Finally, the steps taken to minimize the power dissipation in 
the nuxed*signal ASIC w^ere successful, lo the point that the 
IC operates at junction temperatures less than IO(J-C under 
even the most extreme conditions. 

Pen Operation 

The black mid c<jk>r pens in the HP DeskJet 8CX) fiunilv of 
printers operate as a matcluHl jiair to deliver lugli-t|uality 
color docimienLs. The pens themselves are in manv" senses 
the heart of the printer, and all of the elec-tricai and mechan- 
ical systems are designed to support and optimize their per- 
formance. Tlte electriciil systems that drive and control die 
pens accomplish two major tasks: diey maintitin the temper- 
ature of the printheads to optimize the \mt\i qualify, and they 
drive the correct inKjet nozzles at the right times to print the 
desirefi image on the paper. 

The viscosity of the iixk m the pens is sensitive to tenijjera- 

ture. aiMl the size of the drops ejected by the pens is sensi- 
tive to iuk viscosily. By t^ontrfjliii^g tlie temperjiture of the 
pens, the viscosity and therefore the drop size can he con- 
trolled. Consistently sizetl droi>s provicie the best print quality. 
Integrated into the pens is a temperature sensor, which can 
be used to measiire and control the temperature of the pens, 
m\(l iheiTfore tlie irvk viscosity and drop volume. 

In previous generations of inlget pens, the task of driving 
the pen nozzles has been fairly straightfoi'w^ard. Tlie nozzles 
w^ere arranged into colunms on the pens, and every nozzle 
(or firing resistor) was controlled and drivf^i diiectty. The 
pens in tiie HP DeskJet 82tH ' printer have -M) nozzk*s (black 
lien J or 192 nozzles (color peu^ (>4 nozzles for each color}- 
This high no5^1e count, makes it iittpossible to drive each 
noijzle directly witli a dedicated sigruil ami intercontiectJotK 
For t b ese high- n ( mAv -co u j 1 1 [ ) e n s , a i uai ri x d ri ve t ec i u i i( j u e 
is used. Tlie metlKjd is die saiiu^ for the black and color 
peirs. The matrix diivi^ has two benefits. First, the ninnber 
of connections lf> tlu^ peas is now much lower 22 addresses 
ajid IG colunms cmi select 22 x 14 = 308 nuy./Avs. Tlie con- 
nection iount is 22 -H 14 + 14 = ^iO. (The sei uiul 14 connec- 
tions fre for the rohmin grountl retuni currents.) Second, 
for power reasons, all the nozzles cannot he fired at one 
time. For each nozzle, a 2. 5- [is firing jjulse is applietl to tlie 
mizzle resistor that boils the ink and ejt^cts the drop. The 
pulse voltage* is ab(Hit 10\' and the current is 25i) niA* If Ml 
30<) nozzles were driven at the same time, a total current of 
75A at UN would have to be avaiiabJe! Tins is imiJractical. 
For power reasons alom\ all the nozzles cannot be fired at 
once, and the matrix driv*^ provirles acojiVeiiicni way to 
distribute firing the rmzzles in time. 

The i>en is electricrahy constntcted as a series of 22 address 
inputs driving FETs in the mws of the matrix and 14 primi- 
tive infMits (iriving the firing resistors in the colunms of the 
matrix. Inside tlu- pvu are seliHtitin FFTs for each nozzle 
resistor; these can enalile or disable a given tiozzle. 

The i>en is rlriveu one uddress row at a time. First, address 
one is driven, turning on the selection FETs for the tot> njw 
of 14 nozzle firing resistors, Any of the 14 primitives art^ then 
driven (silL a few, cir none) with the t>reviousIy menlicnKHl 
lOV, 2rjO-JiL\ pulse to fue the desiied ntizzles for t^acli n Juuin 
asscK iateil witli row 1. Addic^ss one is then turned otf and 
aiidress twti is dtivini, selecting the next r4>w of FE,Th and 
nozzle ix*sjst()rs. The desired iirimitives are agahi diiveit, but 
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this lini^ firing nozzles associated with row 2. TTiis process 
b c oiitiiuied through adfiress 22 anti llicn repeated. By se- 
qtiencmg througii all 22 addresses, every one of the ;300 
nozzles c:a J 1 f)e selected and driven. fNote: 22 addresses 
times 14 pnniiiives yichis :I08 iHJTeniial jiozzies. Shicethe 
[jet I only has 'MH) nozzles, eiglit (jI' Ihe combinations do not 
have a selection FET or nozzle resiston) 

Mechanically, the pen nozzles are aixangcd in a patieni to 
generate proper intages, even though all the nozzles arc* not 
fned at the sjnne time. Tlie hlack pen in the IIP DeskJet 
820r is ca[)al>le ot6d()-dpi printing. The print swath (rhe 
band (jfinli jniiitetl in one pass) is 1/2 iru-h Ingli, ajid the 
columns of dots in ihe swath are l/OUO inclr apart- 

A sintple example will iliustiate how tiie nozzles are ar- 
ranged. Suppose we want to priait a vertical line, 1/600 inch 
(one <lol ) wide. If the pc^n were eonstnicted to fire all the 
noz/Jt\s at the same time 1o print a vertical colunnu Ihe 
noz/les would be aiTanged in a veitical line un tiie pen. For 
power reasons, the nozzles are tired in 22 different gronps of 
J 4 (the 22 addresses and 14 primitives), antl these aie not all 
driven at tiie same time. Since die [sen is contmuously mo\ing 
while the* nozzles aie fned, the desired vertical line would 
c:ome out jagged or sloped if the noiszles were arranged in a 
straight line on the pen. To get a straight line on the page, 
the nuKzIes aie staggered to comi>ensate lor die timing dif- 
ferences of the firing (see Fig. 1), 

There is one additional complif athig fiicton The f>O0-dpi 
black f)en has the nozzles anaoged in (wo groups, (jdd 
nozzles in one column (with some stagger to compensale for 
tbe filing rimuig), and even nozzles in another cohimn about 
4 mm away. Two nozzle columns allow l/-i(K)-in( li spacing 
b et w ee n ii ozz 1 vs i n a give n co hi mn lat h er t han I /t>0 04 1 1 c h , 
making tiie i>en e^isier to mai\ufactme. 

Now, to print a vertical, one-dot- wide line on tlie page, the 
odd nozzles are first driven^ one address group at a time. 
Some time later, after the pen/carriage assemhiy has nio\'ed 
4 mm and the even [lozzles are o\'er the same location on 
the i>aper, the even nozzles iiXG driven, one address group at 
a time. 

The implication is that the data sent to tlie pens tynisl l>e 
seQuenced pr'o]jerly to compensate for the nozzles being 
fired in 22 different address groups and also for the 4-inrn 
odd/even no^/Je spachig on the t^en. 

The color pen is constructed in a sinnlar maimer. Just Uke 
ihe biack pen. the nozzles are fin^ii one address at a time 
anrl art^ staggered on the jjen to comijensate fur this, Tlie 
fii*^t to of the 22 hlack address lines are shared between the 
black and color pens, while the color pen has its owti uniciue 
12 primitive lines. The 16 addresses times 12 primitives me 
sufficient to drive die 192 color nozzles, 04 per color. Like 
the black pen, the nozzles of the thiee colors aie placed in 
dual odd and even columns, Be1:ween all die colors and the 
odd mid even columns there are six different color nozzle 
placement columns, all witii small-scale stagger. Tlierefore, 
for a color document, the sequencing of data is even more 
complex liian for black piintuig. The data has to be timed 
for the address and row sequential fuing, separateil for odd 
and even nozzle cohmms, and timed to t^ompensate for llie 
displacement of the three colors with respect to each other. 
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Pig, 1. Black pf^n isnzzli' |)ki('orrn'nt. 

The task of sequeticing the data for the pen iioKzk^s was 
traditionally handled by the t'l inter digiial (Hectronics. In the 
HP DeskJet 820t\ the PiiMiing I^erformain e Architecture 
(PPA) mipi erne 11 tat ion moves the clata sef|uencing task to 
tfie host PC ;md dri\ er, where a lot of data processing wai> 
alreac!y benig flone. This relieves tlie printer of the burflen Of 
this data sequenruig task, and allovi's it to simply drive the 
nozzles selected by the data coming into the piinter. 

Pen Drive Electronics Functions 

The pen contrtjl ami (hive elect rr a lies have tAvo key t^isks. 
Tliey pro%ide the driving signals to eject the ink from the 
pens, and they pro\4de a temi>eratiire ccaitrol system to 
maintain a constant temperature m the active area of die 
pen. To accomtdish this, the overall cairiage electronics 
system has the following functions, most of which are inte- 
grated into the Ttiixecbsignal ASIC (see Fig. 2): 
IVo-way tiigital interiace bet^veen the pen chive electronics 
and t he main digital controller ASIC ui the printer (see ar- 
ticle, iJage ■31). Data is sent to the carriage to conti'ol which 
pen nozzles to fire during t>rin!ing and u> give control com- 
mancLs for die analog-to-digital converter (AI)C), pen power 
supply, and other circuits. Pen measurements made by the 
mixed-signal ASIC aie sent to die digital ASIC. 
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• 22 address drivers to provide the 121* signals to turn on the 
reT% inside the iiens and select the corret t mw to \k* driven. 
Thest* ciri\"erN are shared l>y the blat-k and <'olor pens, 

• 26 coliunn drivers to pnn-ide the higJt-c'urreni drive that 
fires the ink out of the ijens. 14 driA ers are used by the bla^k 
pen aii<i 12 by the color pen. 

• A 30W. programnial>le dr-to-dr (ijiivener to prmide preei- 
Sion pow er for ttie toluniii driver signals. 

• Two leniperature t/onirol systenis. one for each pen. This 
consists of an ADC to niake calibration measureinenis on 
the pen temperaliire sensor, DACs to set largel tempera- 
tures for the pens. ;uui control conipanitors ajtd logic to 
inipleinetil ihe reriipei-ature contiol system. 

• Electronics in measure the pen Eiing resistors, to determine 
if the pen is daiiiaged. 

• Circuit r>' to jii'^^^'ide thermal protection to the analog ASIC 
and resetting functions. 

Tlie overall sysieni t>rovides sill of f he rneaiLs necessary for 
the digital ASIC and nnn\\ are to control tiie i^ens, l>oth to 
maintain tiieir target temperature, and to drive speciTic 
i\ozzles to pnni ifie images desire (i by customers. 

DC-to-DC Converter Design 

The dC'to-dc converter thai provider regulated power to 
drive the pens uses a new digital control technique. The 
feedhark control of the regulator is a simj)le. piu-ely digital 
system. If Uie voltage is too low, it tunis on the regulator to 
full power and charges the Inilk-storagc fiiter capacitor to 
tjie target voltage as fast as possible. If the voltage is high, it 
turns off tlie regulator comt>letely Fig- 3 is a block diagram 
of the converter. 



Fitj. 2. Block diagram ariiif> mixed -sigrml ASK*. 




Fig. 3. Uhnk tliagrain of thp dc- 
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This fontroi technitjue has several afivantagcs: 

• The t;ontroJ system is v^ery simple and is easily iiitegrateil, 

■ The control system is inherently very stable and does not 
ret juire additional compensation components or con I rolled 
values for the jtidmior aiifl capacitor. 

• Tlie effective l>andwidth of dip regnl;^tor is very ini^h, so i1 
responds to rhang(\s in tlie load vei^ quickly. Since (lie kjad 
placed oti tite regulator by the |:>ens can change front to '3A 
in less than 100 iis. fast response time is useful 

■ As a result of the inl\orent stability ajid fast banci width, the 
size of the bulk storage capacitor could lie reduced. Only 
one cai)acitor is needed in the IIP Desk-Jet 820C design 
where three identical parts were used in the D€\sk,let 850t" 
design. Tliis was a key contriliulor lo the go^U of getting 
eveiytJiuig to lit on ilie carriage ijrinted circuit boairl. 

• Additional simj>le digital control fimctions, such as overcur- 
rent and undeivoltage shutdowns, were easily integrated. 

Beyond the savings in compottent s, t he bigi^esl benefit tliat 
til is control topologv' presented was design flexibility. The 
definition of the ntixed-Higiiril ASK\ which contains the con- 
troller for the rpgulapjr, iiad to lie tin^dized luontlis before 
any testing could be statted. By extiemally generating the 
clock for the regulator In the main digital ASIC under firm- 
ware control changes could be made to the regulator in 
softwavf'. up to the day primer |)rofiuction began. 'IVo of the 
key t>arameters in die design cjf any s wit citing power supi^ly 
are the switching frequency and liie maxintiiin duty cycle. 
By moving the generation of tht^ ckjck frequency <\\\i\ duty 
cycle to the linn ware iind cUgitai haidwaiet tiie final deeisicjn 



on the clock tiaranietcrs could be delayed until the systetn 
WiLS carefully tested and iinalyzed. For instanc e, as changes 
were made lo the printed circuit board layout, the clock wiis 
fined lined to compensate for the differences in performance 
I hut were seen. The clock ciin even be dynamically modilied 
to j>rovide rt^gulator behavior to match the pnnter operation 
mode at any given tmie. 

The net result is very successfuL 11 lu piogiaiiimable clock 
was used to time the regulator petfontiance to match the 
system m^ed iifier many firototyiJes had been built and char- 
acterized. The r e g 1 1 1 at o r s w i I c 1 1 es 3( I W of i ) o we i f r rm i a 
poorly regulatetl 18V supiily to a very^ welhregulatr^d, pro- 
grammable voltage apt)rot)riate for eitlier the black pen or 
the color pen. The regulator only uses about about Ci cm- 
( — 1 in^) of ijrinted circuh board iirea. 

Ac know le dgm en \m 

-John Widder imd (leorge Bai behenn, both of the HP DeskJet 
850C develo|)ment team, provided vital consulting on the 
design of the pen drive and control systems. Mark Tliackray 
and Leaiin MacMillian worked on the development and 
implemc^ntafion of the serial digital interface for the mixed- 
signal .AuSlC. Mark Garboden mid Carl Tlioinpsou wrole the 
Onuwiue that contitiis all of the cairiage elect njiiitrs and 
iutegrated the tissoiled electronics components into a cfjm- 
[jlete pen control system. Steve Stemple, the manufacturing 
support engineer foi* the IIP DeskJet S20C\ provided endless 
testing and design verification throughout the prototyping 
phases of the project* 
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The PA 7300LC Microprocessor: A 
Highly Integrated System on a Chip 

A collection of design objectives targeted for low-end systems and ttie 
legacy of an eartier microprocessor, which was designed for high-volume 
cost-sensitive products, guided the development of the PA 7300LC 
processor, 

by Terry W. Blanchard and Paul G. Tobiii 



In the process of do vel oping a nii<Topnj<'4^sst)r, key decisioi^s 
I iV ^iiitliiig piinriples must he estahlLshed to stM I he boiirKi- 
;tries for all design r ItH-isiojis. Tliese gtiiding [jrinriples are 
{le% eloped through analysis of mai'keiing. l>usiiiess. and 
technical requirements. 

Several years ago, we determined that we could best nieel 
the needs of higher-voUinie mitl more cost -Hensi live [jrodnctis 
l>y fievelojilng a dilTeivnt set of ("l*i s tiinerl lo the siiecial 
leiiuiremenis of these low-(^nd, midraiige systems. The 
PA 7I()()LC v^as the tlrst pnu-essor in this line, which con- 
Unues witJi tiie PA 7300LC\ 

This arri< Je will review the guiding imneiple^ used during tlte 
di^velopment of t J it* PA 73 DO I. C niicroiirtK*es,sor. A l)rief over- 
view of file cliip will als(j he given. Tin* other PA 7'KIQLC 
articles ineluded in I his issue will dest^nbe TJie technical 
contributions (jf the PA 7:i0t)LC^ jji detail 

Design Otjjertives 

Alt In nigh \hv PA T-'JOOLC was targeted lor tow-end systems, 
cost, perfonnance. ptjwer, and other design objectives were 
all given high prioriiy. With the design objectives for the 
rA7:iOtJLt' wewanJedIo; 

• Optimixe birenti^'-level through mith'ange higli -volume 
syst en vs (workstations and servers] 

• Provide exceptional system price and perfoniiance 

• Ronglily double tlie performance oft fie PA TIOOLf 

• Provide a high le\ t'l of integration and e^ise of system design 

• Provide a higlily t t>rifigurable ami sealal>le system for a 
broad ratige of sy.slejii contlgu rations 

• Tune for ivaJ-world ajjplications jiiid tieeda not jtist bench- 
marks 

• ICrnphasisct* qiiality. reliability, attd tnanufacturatnlity 

• i'nnide powerful, low-cost graphics capal>ilities for tetlmicfil 
workstations 

• Ifse the mature HP ('M()S14C a:3-volt 0.5-pni process 

• luse mainstream, liigh-voluinis and low-cost terfmologies 
whiU* still providing the ruH-essaiy pedornuuiee inereiises 

• Emph^LsiKe time to maiket through the appropriate leverage 
of reahires fnim jf review is f PHs. 

Meeting tJesign Goals 

We i)egmi liy U'veraging tin" superscalar processor eore 
found in the PA 7100LC' processor. Fire t we investigated 
the value *jf iiigh integration. Next we added a very large 



em betid ed primal^ cache, now feasible with the U,5-|.ini 
tcx-hnologj^H Tlien we enbaiu ed the CPU core to take advan- 
tage of the new^ on-cliip eache by reducing pij)eline stalls. 
We also ensurwl high manufacruriiig yieUls by atkUng ciiche 
redundancy. 

We found that integration supported oiu* design goals in 
miitiy (lositive ways, Becaust" the pnniar>^ cache, the siH'ond- 
ar> ^'at^hf^ controller, and the DRAM controller could be on 
the simie eliip (see Fig, 1), we had an rti>porl unity lo (h'sign 
aj\d optiiUiKe I hem together as a single suljsystem. This was 
a large factor in allowing us to achieve sttch an aggressive 
system price anrl perfoniiiuite |>oint. The lugh-iniegration 
approach also yieklerl mncii sirn|)ler systen^ design options 
for our system partners. To fiuthtM' supp<jrt these* paitners, 
we designt^d tJ\e integrated DRAM, l(*vel-2 cachis mid I/O 
bus controller wiUi extensive Cijnllgurahility (see '^ConJlgm- 
ability of the PA 73rM)LC'* on page J5j. This tH»nfigural>ihty 
enal)kHl a witle variety <jf sysiem options ranging irum 
compac^t and low-c*ost systems to much more expandable, 
ind list ri al-streiigt h systems . 

We were careful uol tc) lake a eosiTirst apta'oaeh to this 
(] esigi I . We l>e I i eve t h at | h ■ r ff m i lar ) t e i s J u s I as i m t " >r t mi t h j r- 
cuslfjinei's ot I IPs lower-cosi sysietns. We t(jok a trital sys- 
teni approai h in oi^iimixing performance while ivmphasizing 
application perfo nuance over liench marks in making design 
traflt^offs. Tlu^ highly optimized menioiy hierEUchy shows 
drajumic^ inipixivemenl IV^r thr nuMmii7-inti*usivi* pn>grtims 
ffMunl in technical aiid (commercial markeLs. 

Allot tier way of meeting our performanee goals wiis to [jiish 
the frequency while increasing the h^vel of integral rrjo- We 
focused early on the layout and floor plan of the chip to 
enable high(TTre<|Ueucy ojnTaticm, Tlirough this eflVjrt, all 
ciilieal paths were uplimiiced. We tracked and opiimixed 
02,000 indi\i(kuLl timing paths during the design phase. 

Desfnie leveraging fhi" rlesign IVoni an existhig TPr. the 
PA 7:itH)hC divsign leatn still evaluated a large array of tech- 
nka^ features and alteniatives to meet our performance 
goals. Fundamentally. ouraj>])roat^h was to binld a robust 
CPl' using a simpli*, elfieiciil inirrr^an hit et lure. Such a 
design ran less risk of fmniioiial bugs mul allowed physit*al 
designers more leeway to }jush tlieir circuits for higher 
performance. 
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Fig. 1. PA 7300LC system design. 



On- Chip Primary Cache Decisions 

It was clear fjom the beginning tiiat tiie CM0S14C process 
would allow an on-chip cache of reasonable size, so a signif- 
icant investigation was done to determine an optima] cache 
size and configuration. HP's System Performance Lab in 
Cupertino, Califonua tissisted us by repeatedly iimning 
benchmarks atad code traces with different cache topologies 
and memory latencies. 

Optimal Cache Size, Finding a balance between instruction- 
cache and data-cacfie sizes was difficult. Tlie PA 7300LC 
was intended for use in both technictil markets, where 
larger data caches are fie si red, and conunercial markets, 
where programs favor large instnictioii caches. The stan- 
dard industry benclmiarks can easily fool designers into 
using smaller instruction caches, trad hi g ibe space for more 
dat^ cache or simply keeping the caches small to increase 
the chip's frequency, HP has always designed computer 
systems to perform w^eil on large customer applications^ so 
we mcluded them m our analysis. Ultimately, we found that 
equally sized caches scaled extremely well with larger code 
and data sets. Tile typical performance degradation found 
when a program begins missing cache was mitigated by 
large cache sizes and our extremely fast memory system. 

We could physically fit 128K bytes of cache on the die, so it 
was spht into 64K bytes for the instmetion cache and 64K 
bytes for the data cache. Not only would tlus pro\ide 
impressive perfonnance. but we noted that it w^ould be 
the largest on-chip cache of any microprocessor when it 
began shipping. 

Cache Associativity, f ache iissociati\it>' was another issue. 
Recent PA-RISC implementations have used very large di- 
rectly mapped (off-chip) caches. .Associativity would reduce 
the potential for thrashing in the relatively small 54K-byte 
caches, but we w^ere w^orried about addiitg a critical timing 



]Dath to the physical d^ign^selecting the right wmfof 
associativity and multiplexing data to the cache outputs. 
Increasing the ways of associativity would further reduce 
the tlirashijig, hut make the timing even worse. The Systems 
Perfonnance Lab included associativity \i\ liieir pertbrrnance 
simulations, helping us anive at our decision ttJ implement 
tw^o-w^ay caches. To reduce the impact on timing, we elimi- 
nated cache address hashing, which had been used to re- 
duce tltrashing m dhectly mapped cache designs. Once we 
added associativity, hashing was no longer necessary. 

Associative cache designs also need an algorithm for deter- 
mining w^hich way to update on a cache fill. Again, there are 
many alternatives, but our simulations show^ed the easiest 
approach to be the best. A pointer simply toggles on each 
fill, so that the ways alternate. 

Other Cache Decisions 

Mariy other cache decisions fell out of tlie same tyt>es of 
analysis. The data cache uses a copy-back rather than a 
write- through design''''^ and a 256-bit path to the memory 
ctmt roller was included for smgle-cycle writes of copyout 
lines as shown in F\g, 2. 

Moving the caches onto the chip also siriiplified changing 
the CPL^ pipeline to remove tlie "store-tair penal tj^ in w^hich 
stores on consecutive cycles cause a hang. This made it 
easier for compilers to optimize code. 

' Way. or M-way associativity. Js a technique useiJ to view a singfe physical cache as !^ eqifaily 
sized logtcrai S4ji3cach^s Th& PA 73CI0LC caches are twtHvayassocriative. so each MK-byte 
cache has two ways cf 32K hyies each This provides iwo possible locations for any cached 
FiernorY data, reducing the thrashing that can occur in a direct-mapped cache when two 
memory references are vying forttie same locaiiiofi 

Mn a write- titrnugh cache deSFgn, data is written to both rhe cache and main memory on a 
write Irr a copy-back cache desjgn, dafa is written to the cache only, and rs writtan tcr main 
memory only when necessary. 
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Configurability of the PA 7300LC 



Rattier than choosing a smgie. mtlexibie memory and ievel-2 cache con- 
figuration, we architected the PA 73Q0LC so thai systeffl designers can 
make price and pefformancs trade-offs them^lves. Most of the choice 
available to designers are in the memoiY system 

Bus Frequencies 

The PA 7300LC QSC (gejieral system connect) bus mterface supports 
several CPU GSC frequency ratios GSC frequencies a! or near the bus 
maximum frequency of 40 MHz can be mamtamed even when the CPU 
IS running at non integer multiples of the bus frequency (e.g., 132 MHz) 

Memorv Interface 

The memory interface can be designed with either 64-bit or 128'bit 
172-bit or 144-btt with error correction) data paths. A maximum of 16 
memory banks is supported, and each hank can hold from 3M bytes to 

51 2M bytes of DRAM, The DRAM technology can be either FPM (fast 



page model or EDO (extetided data out}, with chip sizes fnrm 4M bits tD 
256M bits A brosd range of ORAM speeds is allowed, as DRAM timing 
can be software programmed using a nine-element MIOC (memory and 
I/O controller! timir^ vector 

Memory error con'f^"* ~ ^ "~'r<naL Sirvgf" ■* ' '""^^t and double-bit and 
foiir-bii borst error . nemes art - 1 with sufficient 

error laggir>g for system diagnosis and progiam data protection. 

Level '2 Gaohe 

The leveJ'2 cache js completeiy optionaJ. three types of SRAM are sup- 
ported regjster-to-register. f iow-through, and asynchronous Depending 
on the SRAM speed and CPU frequency, level-2 cache latencies of two. 
three, or four CPU cycles can be programmed into the MIOC. Parity eror 
proTEclion an the SRAM data is also opttonsL 



Adding Spare Columns to the Cache Arrays. Maiiufarturability 
is a big t-oncem for huge \'l.Si jnemniy stnirtures like the 
PA 730OLCs cacJic'!?, I.)ens*^ regiilaj- st nurtures liki^ cache 
RAM ceUs are very susceptible to the smallest mmiofactLning 
defec ts, and jiiist one failing bit out of l,200,n92 can niakc^ a 
liail useless. To coiniJeuAate. the t^aehe design temn added 
spare columns to the cache arrays. Dmins the initial wafer 
test of a CPU die. an internal built-in self-test (BlSTj routine 
iitns to check for errors. If a bad RAM cell is found, tiie BIST 
signature indicates wMcb ctjlunui should tie swapped cait, 
and a laser Ls used to blow a sjiecinl iiielal bise on the i hip. 
The I lad column is icplaced with I he s(iare. fully restoring 
tiie chips fiuicriunality. The aili<'ie on pa^e lil describes this 
featiu'e in detail. 

Integrated Memory and t/Q Controller Decisions. Incotporating 
(he riieinor>' iuid Uo <oJi(rnller tMlOt ) nnin Ihe PA 7in()L(^ 
chip was an itnpnriani ]»errnnniuice win. and we worked to 
itiakt* il even lietter on the PA 730DLC' Simply having rhe 
MU)C and t'lH ' on the same die is extremely e flic lent, 
Aji off-chip MlOV wouhl require a chip crossing for each 
data re<iuesl anfl eiritii rclinu. Chi|) crossings are liine- 
consuniing, costing many chi]) cycles at lliO MH/,. SuK'e 



tlie CPU stalls on a critical reciue^t, chip crossings directly 
degrade peifoi-mance* 

Chip crossmgs also re^iuire additional puis on i>ackages. fhi\' 
mg np the cost. As a result, designers stri\'('^ to keep external 
data paths itarrow. Wifli the MIOC on-chip, we were ai>le to 
use wider data pal Its liljerally for faster transfers. We placed 
some of t lie MIOC's hiififers uiside the cache and used wider 
data paths to create a bus tliai is or\e cache line wide for 
hiasting cache copyouts to the MH X' in one cycle. 

Cost and Performance Decisions. Despite all the iierfonnancip 
eirhancenuMiLs Ihe increase<i VPV Irecitiency plac erl a Inir- 
den on Ihe MIOC to minimize memoiy lah^iK'ies mid pipeline 
stalls because of filled request queues. Blocking for an off- 
chip resource <'osts more (TC cycles a! higher freriuencies, 
so it was i)arantount I hat the PA T-'MHJLC MK H' be fnsi mul 
efficient. The i ludlenge was in achieving this without signifi- 
cantly iurreasing the system ctJSt. 

Doubling the external nienioiy data path to 128 l>ils was a 
clear | serfomiance advantage. I>tit h Jrilso increased system 
cosh Addinf* 72 (01 data t 8 error conection) pins to the 
CPl ' die and i>ackage came at a jirice. We were concerned 
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that system designers would also be forced to create more 
expensive niemoiy dc^signs. Configurability was the best 
solution* The increased perfb nuance warranted adding piiis 
to the CPU, but tlie MIOC was designed to support a 6^-bit 
mode for less exiieasive nieniory designs in low^-cost systems. 

Off- Chip Second-Level Cache Performance, In addition to the 
primary cache, one of the PA 730(]LC's most intriguing fea- 
tares is its second-level cache (see Fig. 3). Even with the 
MIOC's very fast memory accesses, it takes at least 14 CPU 
cycles for cache miss data to be returned. Ulule this is 
excellent l>y industry standards, we had tlie opportunity 
to make it even faster by implementing an off-chip second- 
level cache. 

In many cases, the CPU' is stalled during the entire memory 
access, A typical second-level external cache design could 
drastically reduce the number of stall cycles, but would be 
expensive. Tfie engineering pros and cons were debated, 
and a very interesting solution was found. Address pins for a 
second-level cache were added to the CPU, but the second- 
level cache and DRAMs share the memory data lines (either 
64 or 1 28). Very fast FET switches are used to shield second- 
level cache accesses fiom the heavy DRAJVI Ime loads mitil it 
is deternuned that the second-level cache w^ill miss. While 
adding one cycle to memory accesses, this technique re- 
duces access time to only six cycles on a second-level cache 
hit. The second4evel cache is optional for low-cost systems 
or for those applications where a second-level cache Is not 
beneficial. 

MIOC Design EnhancefiieFits, Intenmlly, the MOC design was 
enlianced in many areas in the PA 7100LC MIOC. Tlie inter- 
nal pipeline was split into indepei^dent queues for memory 
and I/O, preventing memor>' stalls during long I/O operations. 
Reads can be promoted ahead of memoiy writes to satisfy 



CPU requests rapidly^ and graphics writes are accelerated 
aiiead of other transactions to increase graphics bandwidth. 
Finally^ the GSC (general system connect) interface was en- 
hanced to improve graphics bandwidth by well over 200% 
over the PA 71D0LC and lo support a broader range of 
CPU: GSC operating ratios. 

CPU Core Decisions 

Removing the Phase- Locked Loop. BecaiLse of its higher oper- 
ating frefjnency, the original PA 7300LC design contained 
a phase-locked loop cucuit to synthesize both CPU and 
system clocks. Desigrung a phase-locked loop in a digital 
CMOS process is challengmg and historically has affected 
yield and robustness in VLSI designs. \\Tien an inexi^ensive 
external clock part w^as found, we decided to recover the 
phase-locked loop circuit area and reduce technical risk by 
removing it. 

Integer and Data Cache Controller Enhancements, llie on-chip 
caches caused both the integer and flata cache controllers to 
be redesigned, and significajU enhancements were included 
in both. The data cache controller added a deeper store 
buffer, and by also modifying the instruction pipeline, we 
were able to eliminate completely !he siore-tail problem 
mentioned earlier. Also, memory data is bypassed directly to 
execution units before eiTor correction, with later notifica- 
tion in Oie rare event of a memory bit enor. 

The instruction cache controller expanded the htstruction 
lookaside buffer (TLAB) from one entry to four, and im- 
proved the performance of bypassing instructions directly 
firom the MIOC to the execution luiits. Botii are very tightly 
coupled to the MIOC so that memoiy transfers to and from 
the caches are extremely fast. 

' The GSC is Ihe local bus that js dasigned to pmvirie maximum banriwtdth for memofv-to 
graphics transfers. 
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Stimiimr>* 

We developed a set of gukJing prinnpies hiised ujK^n niarket- 
mg, business, and teehoic-al requirements for this system. 
The gtiidiog priiicijjles enaJ>le<1 ihe design of an exceptional 
niicroprot^essor targeted to the volume and price/perfor- 
manee requirements of the workstation and sender market, 
A large part of rhe overall succe^ of this design (*omes front 
the weU-en^ueered < aclie and memorj' hierarchy. Tlte lec"h- 
nology we chose allow^ed us to develop a high^-apacity 
primary cache and a rich set of performajice-improv' ing 
features. 

The PA Ti^OOLC design met its schedule and exceeded its 
perfonnaiu e gc>als. Customers are receiving PA 7300LC- 
based systems today. 
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FltiaUy. we express appreciarifjn to tlie Systems Performance 
Lai) for then- efforts in nuuting performance sitmilations 
on oin- behalf, A special recognition is made to Tiaii Wang 
(developer of the PA 7-KJOLC performance sintulalor) who 
passed away during this development effort. We extend our 
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Functional Design of the HP 
PA 7300LC Processor 



Microarchitecture design, with attention to optimizing specific areas of 
the CPU and memory and I/O subsystems, is key to meeting the cost and 

performance goals of a processor targeted for midrange and low-end 
computer systems. 

by Leith Johnsan and Stephen R* Vndy 



Tlie PA 7300LC microprocessor is the lalesi in a series of 
32-bit PA-RISC processors designed by Hew leii- Packard. 
Ukv its predecessor, the PA TIOOLC,!- Ifie PA imOhC 
design focused oti opt imizing price iu id perronviaiice. We 
worked toward achieving tire best perfonoaiice possible 
within the cost structures consistent w\\h midrange and 
low-t?nd systems. Tliis paper descrities the microarchitec- 
ture of the two main components of the PA 730 OLC: tlie 
CPU core imd the memqiy and I/O controller (MOC), 

CPU Core Microarchitecture Design 

Approximately one-half of the engineering effort on the 
PA 7300LC processor wtis dedicated to the design of the 
CPL' core- 'Hie CPV core includes integer execution uiuts, 
fioaling-point execution units, register Gles, a translation 
lookaside buffer (TLB), ami insl ruction and data caches. 
Fig. i shows a block diagram of the CPI ' core. 

Core Design Objectives 

The design objectives for the PA 7300LC processor were to 
provide the best possible perfonnance while c^hoosiitg the 



proper set of feat ures tJrat would enable a system cost a}> 
propriate for en try- level atul high-volume workstation prod- 
ucts. To reach this goal we integrated large primary caches 
on the processor chip and developed a t ighi coupling be- 
tween tiie CPL' core and the memory and 1/0 subsystems. 
The desigTi objectives for the PA 7300 LC are discussed m 
detail in the article on page 43. 

CPU Core Differences 

The PA 7300LC CPU core is derived from the PA 7100LC CPtJ 
design. ^^^ Although Uie PA 7300LC has many similarities 
with its predecessor, there are some key differences in tiie 
design that allow^ed us to meet our peiformance objectives. 
The fust difference is diat tlie PA 7300LC nms at 160 miz 
compared to only 100 MHz for the PA 7100LC. The most 
ob\lous difference is the large piimary instruction and data 
caches integrated directly onto the PA 7300LC cliip. The 
PA 7100LC only has a small (IK-byte) instmction caclie on 
the chip. Also, the orgaiuxation of the caches was changed 
to avoid many of The st<"ill cycles that occur on tiie PA 7100LC. 
The cache organization is discussed later in tius article. The 
PA 7300LC has a 96-entry TLB^ compared to 64 enlxies on 
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the PA 7100LC. Finally, the PA 7300LC has a four-entry 
instruction lookaside buffer (ILAB) while the PA TlOOLC's 
I LAB contains one entiy. 

Pipeline and Execution Units 

Like all liigh-perfoniiance microprocessors, the PA 7300LC 
is pipelined. TOiai is notable about the PA 7300LC pipeline 
is that it is relatively short a! six stages, while running at 
160 MHz. 

Fig. 2 shows a diagram of the PA 7300LC pipeline. It does 
not differ greatly fi'om the pipelines used in the PA 7200, 
PA 7100LC\ or PA 7100 processors. ^^"''^'"^ The foUo\^ing opera.- 
t ions are performed in each stage of the pipelme shown in 
Fig. 2: 

1. Instruction addresses are generated m the P stage of the 
pipeline. 

2. The instruction t iuhe is accessed during the F stage. 

3. The instructions fetched are distributed to the execution 
units during the first half of the I stage. During the second 
half of the 1 stage, the instructions are decoded and the 
appropriate general registers are read. 

4. The integer miits generate their results on the firat lialf 
of the B stage. Memory references, such as load and store 
instructions, also generate tiieir target address during the 
first half of the B stage. 

5. Load and store data is transferred betweeri the execution 
units and the data cache on \\\^ second half of the A stage. 

6. Ttie general registers are set on the second half of the 
R stage. 

Superscalar Processor. The PA 7;3(K)LC is a superscalar pro- 
cessor. ca|)aljlf cjf executing two instructions per tnpeline 
stage. This allows it^ ai 160 MH/,, to exe<"ure at a niaxitnum 
rate of 320 niillion justructlons per second Titis, iiowever, 
Is a peak rate that is rarely achieved on real appiications. 
The actual average value varies with the application run. 
The theoretical maximum assumes the proper mix of in- 
structions» but not every pair of instnictions can he Inindied 
together for execution in a single cycle. Fig. 3 shows which 
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pairs of instructions can be bundled for execution in a smgle 
pipeUne stage. 

Delayed Branching. Tl\c PA-RISC architecture includes delayed 

brajiching/' That is, a branch instruciion will not cause the 
program counter to change to the brarich address until after 
the foUo^^ing instruciion is fetched. Because of tiiis, branches 
predicted correctly with a simple branch prediction scheme 
execute without any pipeline stalls. The mi^jority of the re- 
maining branches execute with only a single stall (see Fig, 4 J. 
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Fig, 3, Valid superscalar iiistnietlun €oinlJiiuati<ins for PA 73(WLO. 
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Fig. 4. Branch behavior, (a) Carre cQy predicted brand i. 
Cb) Incorrectly predicted branch. 

Two Integer Execytion Units. The PA 7300LC contains two 
mteger execution luiits. Each contains an ALU (aritlimetic 
logic unit J that handles adds, subtracts, and bitwise logic 
operations. Only one imii, however, contains a shifter for 
haiiiiling the bit extract and deposit instructions defmeci in 
the PA-RISC architecture. Siiute only one adder is used to 
calculate branch targets, only one execution unit can pro- 
cess branch instmctions. This sanie unit also contains the 
logic necessary to calculate nuilification conditions/^' By 
limitmg execution to only one ];)ranch or nuLtifying instruction 
per pipeline stage, we avoided a great deal of functional com- 
plexity. Finally, only one unit contains the logic to generate 
memory adtlresses. Since the data cache is single-ported, 
there is no need to have two memory addresses generated 
per cycle. Tn special cases, however two imeger load or 
store instructions may be bundled together, provided they 
use the same douhJe-word address. As mentioned before, 
these asymmetries hetW'een the integer units prevent any 
two arbitrary integer instiTictions from bmidling togetiier. 
However, even with this limitation, compilers are able to 
take advantage of the integer supersc^iilar capabilities of 
tliePA7300LC 

Multimedia Instructions. The PA 7300LC integer miits imple- 
ment a set of Lustructions first introduced on tiie PA 7100LC 
that accelerate multimedia applications.*-^^ These mstruc- 
tions allow each intej:*cr unit to perform two 16-bit adds, 
s\ibtracts, averages, or shift-and-adds each cycle. Because 
of superscalar execution, the PA 7300LC can execute fom^ 
of these operation's per cycle for a peak rate of 640 million 
operations per second. 

Floating-Point Unit. The PA 7300LC contains one fioating-pouit 
luiit. Contairted in this unit is a floating-pohit adder and a 
floating-point multipLLer. The adder takes two cycles to cal- 
culate a single- or double-precision result. It is pipelined so 
that it can begin anew add every cycle. The multipLLer takes 
two cycles to produce a single -precision result and tliree 

' The PA-BISC an;hitecttrra enebles ceftair; instructjons lo conditionally nullify Qf cancel the 
opera ti an of the following instruction ba&ed on the resufeof thB cun^nt calculation of 



cycles for a double-precision result. It can begin a new 
single-precision mnlliply every cycle and a new double- 
precision multiply every other cycle. Divides and square 
roots stall the f 'PI' until a result is produced. It takes eight 
cycles for single-precisitjn aiid 15 cycles for double-precision 
operations. 

Instruction Cache and ILAB 

Integrating a large primary instruction cache onto the pro- 
cessor cMp broke new grotmd for PA-f^ISC nucroprocess- 
ors. In the past, our processor designs relied on large exter- 
nal primary caches. With the PA 7300LC, we felt that we 
could fmally integrate enough cache memory on the proces- 
sor chip to allow fast execution of real-world applications. 
Indeedj we have iiitegraied twice as nutch cache on-chip as 
the PA 7I0OLC used extemally in the HP 9000 Model 712/60 
workstation (i.e,, i28K bytes versus 64 K bytes). The inte- 
grated cache not only improves performance hiU also 
reduces system cost, since an external cache is no longer 
numdatory- 

Pnmary Instruction Cache, Tlie PA 7300LC primary instruction 
cache holds 64K bytes of data and has a I wo- way set associ- 
ative organizarion. A set associative cache configuration is 
difficult to achieve with an external cache, but mucli more 
practical with an integrated cache. When compared to a 
similarly sized diiectiy mapped cache, it performs bettei" 
because of higher use and fewer collisions. We chose a tw^o- 
way associative cache over other ways to save overhead 
caused by the replication of con\parators and to reduce the 
propagation delay through the way multiplexer 

Tlic primary' instruction cache is virtually indexed and physi- 
cally tagged. Because the PA-HISC arcliit.eet ure restricts 
aliasing*^- to IM-byte bournlaries, we could use a portion of 
the viiluaJ address (hi this case, t:hree Ints) to fonn the index 
used to address t he cache. To avoid usiiig viitual address 
bits would have required us either to place tiie virtual-to- 
physical translation in series with cache access (increasmg 
the cache latency) or to implement a large number of ways 
of associati\itj' (in the case of a 64K'byte cacliej this would 
have requhed a I5-way set associative organi^iation). 

Data Array Requirements. The instruction cache is composed 
of a tag array imd a data aiTay, eacli containing addresses and 
instmctions. Without using more wires or sense amplifiers 
than those found in a conventional cache organiziition, we 
organised the data arrays in ar\ unusual fashion in the pri- 
ttiaiy caches or) the PA 73001.C to meet two requirements. 

The first requirement is for the instruction cache to supply 
tv^^o instructions per cycle to the execution units. Because 
the cache is two-way set dissociative, each location, or seU 
contains instructions corresportding to two distinct physical 
addresses. Thus, for any given set (determined by the in- 
struction fetch address), there are two possible choices for 
the instructions t>eing read. Each of these choices is called a 
group (see Fig. 5a). For speed reasons, both groups are read 
from the instruction data arrays simultaneously. Logic that 
compares the physical addresses m the tag arrays ( cnie per 
group) witii the physical address being fetched from tiie data 

' Aliasing refers tQ inteniionally allowing two drfferent virtual addresses to map to the same 
physical address The PA- RISC archiiecture restricts tha number and location of hits that may 
differ hetwean two virtual addrBSses 
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array clptt*niiines which gioiip is selected and sent to the 
rest of tl^p cm Since tills is the normal iiisinu-tioii Mdi 
operation, it must Ije eoiiifiletpd in a single processor cycle. 

Tlie stH'OTid requiremeni is in be able tr) wrile eigltl instnic- 
tions simtrlUiriPously, all to (he same ^rou|j. Because a write 
or em's as part cjf the catiie miss sequeiite. it is iinportajit 
that, the write Uke only a single cycle to intemipt instniction 
fetches as little as possible. 

Fig. 5a shows the conventional motliorl of addressing data 
arrays. Because of electrical lukI layiuif cojisiderati<jns, live 
iipjier fou!' instiucliiins of t^ach eight -iiistrnc1ioji-Io[\g cache 
line aie kept ui a sepaiale anay from the lower four instruc- 
tions. Both the upper antl lower arrays are addressed and 
rt^ad concurreiilly. There are four iUTays in Inial: M^f^'ip 
Mi>per, groiij) (] lower, gr(.ju|> 1 iij>i>er, ajvd grou[) 1 lower. 
The inst rue lion fetch address sent to the instruct iou 
cache, A{idress[0:llli eojitains twelve bits. Oueatidress bit, 
Address! TO], seleci^s betweeji tJie upper and lower airays. The 
rest of !he adtlress Inis, AddressiO:9] and Address! II], go to aU 
four arrays iuid deternihie which set ( Setlxjl is read out of the 
arrays. This is accomplished with the lI-to-2048 decoders. 
In reality, loiu' rlecodere, one for each array, would be 
needed, l)ut they all connect to the same addn^ss. As dis- 
cussed al)ove, I here are I wo r^ossible pairs of insUucbnns lo 
choose from with a ^iven addiess. A sigJial fn>m logic called 
hit cotnparp selects between the two possibilldea In t-he 
example shown in Fig, 5a. hist ructions and 1 from group 
are seleeted from tlie instruction eache. 

Tliis conventional apiiioach meets om' first requinuneut. 
However, it doesu<il meet onrsetond retiuiremeul. It tminoi 
access all eight ijistiuctions as a single group sijiuiltaneously. 
This is because a cache line is located in two adjacent sets 
and only half of the hue can l>e read (or more important, 
written ) at ajiy ime time. For exainjjle, if the grruip If upper 
aiTay is suj ([dying instructi(ms t) and L it olniously cimnot 
suijply 2 iuul 3. The only way to solve this problem w^itii the 
conventional apprxjach is to split each amiy into two halves. 
This, litjwever. wosild re«[inre Ink e as rnany wires aiul possi- 
i>ly sense ruupliller>5 [)rodHciiiga si/iii)le iiu tCrLse in area cost. 
By making a slight modification lo tlie way the data arrays 
aie (jrgmiized kuid addressed, we found we could avoid iliLs 
pitfall tmd meet both of our requirements. 

C)iir addressing approach on tlie PA 7-il)0Lt ' is c ailed flfcchrr- 
boftnliiiff. Fig. 5h shows how insunctions iire t etc bed IVom 
the instrucfiort cacfie on thv PA l-iDOLC There are, again, 



four arrays: left upper, left lower» right upper, and right 
lower. The most siR^tiificai^t address lines, Addressl0:9f go to 
all lour arrays, wliile teft[ll] goes only hi the two left an ays 
aufl RightlB] goes only lo the the two right arrays. A single 
address hiU AddresshO]* selects between the tipper and lower 
airays^ as i>ef(?re. 

Wlien an instmction is fetched, both Leftllll tmd Rightfl 11 are 
set lo the value of AddfesslllJ. Because of tliis, the operation 
is viilually identical to the c:ouventional approacli descrilied 
above, except tor one key difference^: a cache hne for a 
given group is spread a<ross all four arrays, rather Ihan just 
two. This can be seen In Fig. 5b, where the instructions 
corresponding to ^rt>up 1 have lu^en shaded. Eacli anay con- 
tains pieces of cache lines from bot h grtjups in a chn-ker- 
board fashion* 

Fig, be 11 lusl rates how diet kerboarding allows simuItaneoiLS 
access to an etitire cache line. By setthig Leftllll to the group 
desired and Rightful to the opposite viilue. all eight instruc- 
tions Iron] OLie group can he read or written. In the example 
shown hi the figure, au entire cache lijie fiom group 1 is read 
oub Leftlll] is set high, while Right[l H is set low. AddressfO:9l 
selects which pairof sets^ Setlxl and Setlxtl], are accessed, 
I'^ig. t) lists the lesuhs of addressing the airays with the 
various comlnnatit>jLs of values on Left[ltl aiid Rightllll, 

Instryction Cache Hit Stages. 'Hie CPI ■ < ore will atiemjjt to 
fetch a pair of insl!u<ii(>ns from the instnictioii cache every 
cycle during which it is noi stalled. For example: 

• the iiistruclioii fetch address arrives at the instmction 
cache at the eml of the P .stage of the pijieline. 

• On (he first half of the F stage, the word line (iecodcrs fire 
one word iint^ to each array. 

• On the second lialf of the F stage, the array is read^ dri\ing 
its value onto the bit hues to the sense amplifiers. Tlie way 
multiplexer thtni selects the proper pair of instructiotis from 
t he sei I se ai n p 1 i 11 er o u t p u Is. 

• On the first half of the I stage, the instructions aie chiven to 
the executlou units for decofhng and execution, 

hstryDtion Cache Miss Stages, In the case of ait instruction 
cache miss, which is known by the end of the F stage of the 
pipeline, the entire piiieliue will slalf A read request for an 
entire ( acht^ line will tlien lie sent to tJie memojy controller 
'fhis request is called a copy hi request. A t)4-bit data path 
between the memory controller and the mstruction caclie 
requires a minimum of foui* cycles to transfer the entire 
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Timing Flexibility 



Microprocessor design is a time-consuming amJ expensi^ process 
Ideally, a design shoutd scale through several fabfication process genera- 
tEons wrth low- investment algorithmic artwork shnjnk to help amomze 
! of the original design 



jre 
or less riKed Typically ttte t)ase processor design nas the capability for 
a range of core-processor'freqtJBncY'to-rnierconneci-frequencv ratios 

The PA 730OLC has three interfaces that are tijteran! of increases m the 
processor frequency the I/O bus interface, the mam memory mterface, 
and the second-level cache interface 

The cycle time of the general system connect jGSC) I/O bus can be con^ 
ftgyred to some multiple of the processor's cycle time The f/0 control let 
supports ratios from three to nine The second-level cache cantralleJ" can 
be configured to support a vanable number of CPU cycles per second- 
level cache cycle. The controller supports two, three, or four CPU cycles 
per cache cycle. Similarly, the mam memory cpntrofler can configure Tlie 



setup md hold times of itie DRAMs to be two. three, or four CTJ cycles 
Additionally, seven key DRAM timing parameters can be tndfvj dually 
programmed 

As the ptDcessof §ets faster, performance may imprBve but only as a 
K • mem Dry and t/0 perfor* 

rri :i:hes an tfte PA 7300iC 

help insulate tne processor trom ine effects of The nelattvely slow nr^m- 
ory accesses, alltjwmg the performance to scale wail with increasing 
core pmcessor frequency The initial frequency target for the PA 7300IX 
was 132 MHz, but design ratios support core processor frequencies up 
to 360 MHz 

Two additional benefits are defived from the timing flexibility of the 
PA 730OLC The mcreasing availability of higher-speed DRAMs and 
SHAMs makes it a simple matter to cor^figure the timing generators 
to take advantage of these new componems Also, timing flexibility 
decouples the design effort from uncerramties thai develop as RAM 
component vendors traverse their own development cycles. 



carhe liiuMd the instnictjon raclie. Four ryrU's art* rpriuirpcl 
iH^eaiise I lie meni<>i>' rontrolkT am only deliver ri4 Ints per 
eycle mul a carhe line contains 2^^i\ Ijils. Tlte luenioi-}, ton- 
troUer will ret tint the i^air of m^niicikms originally intoiuled 
to be feti^hed fii^tT regardless of the pair*s position within 
tiie taihe line. As each pair of instnictiors is retimied from 
nuMnory, it is written into a write liuffer. Tlu* instnictituis tan 
lie fete lied ciircH'tly IVoni I his hufff^r hefure they are wrilten 
to the cache, with the Ilrsl i>airs anival causing the pij)eline 
to resume execntion. Tliis capaljility is cnnittionly refened 
to as si tram hi (/. In el'feci, the write l>tilTer Inrnis a third way 
of iLSsofiat ivily. After the hist pair of instrHcliuits airivt^ from 
nvenu.>i'>'\ the wiite Imffi i contents an* writ ten to thr eariie 
in inw (^vcle. 

Unified Trartslatton Lookup Table. Since tlu* instrnctidn leteh 
addit'ss is a vlitiiai ad<irt\ss, it nmst l>e tiiapt»ed into a corre- 
s|>£)U(tiii^ physical address at the same titne the insUut tion 
caelie mraysare being accessed. Nonnally, a full instiiu ti<m 
tcnislation lookaside buffer, or ITIJi is ustKi to perform this 
function. On tlie PA 7300LC, els on all recent PA-RISC' ijrores- 
sors, we fell that tlie {MTfonnanfe tintax>vememsafhjeve(i 
with ;l separate ITLB and T>TbB (for data aeresst^s) did nul 
Wiirrmtl till* increased chip area cosi.s. insiead, we opted 
for a imified TLB thai perfomi** both instruction and data 
traiLslations. 

Instruction lookaside Buffer IILAB). Ikn ause hnlh an instntc- 
lioii ;]nd a dat^i trans tat inn art' rer|uinH| on tnany cycles, a 
smaller si met lire called an inslntetioii lookasid*' hnfler. or 
lLi\B, is used to translate ijistntction addresses, while the 
larger unified TLB is free to transtate data addrt\ss<*s. The 
fiatrenlr^y' fLAB is a subset of the unified TLB and contains 
the most recently used Iranslations. This siriife*jy isipiite 
elTeetivi^ btn-aitse insirm liiin addresses, unlike data ad- 
dresses, lentl to be hij^hly correlated in space in that they 
generally access the sante page, a previous t)age. or ihe next 
page. 

When an instnuMion address does miss the 1L.\B. nonnally 
bccaust' of a hriijuh. tlie t)lpefine will stall to transft'r tin* 



desired tnmstation from the unified TI.B to the JLAB. We 
designed hi two features to tuitigate these I1^4B stalls. On 
hriinch instruclirms that are not Ivnndled with amemoiy 
acx-ess instruction (sucli as a load or store ), die unified TLB 
will lie accessed iji parallei with the ILAB, in anticipation of 
an ILAB miss. If die lUA^B imsses, the noniuil [L\IJ stall pen- 
alty will be reduced. The second feature wv added was IIAB 
prefetrhing. Every time the C PL begins exeenting iHi a new 
inslrurtion page, the TLB mil take the opi>oriimity \u trans- 
fer the tnmslation for the next trage int*) the UAB, This can 
eompletely avoid the ILAB misses associaterl with sequential 
code executirjn. 

Data t ache and TLB 

We flcsi^n^'d the data cache array to l><* very similar to the 

insliueiioti eaehe arrays. Like the inst[iicti(m eaebe, the data 

eachi* is iwi*-way set associative, viilually indexed^ atul 

physically tasked. It Is (*omposed of three arrays: 

A data ariay, w^luch has the same checkeibtwud organization 

as the iiistineiion carhe data array 

A tag array, which is alinosl idenii<'al to its instrnt tion 

eacii e t 'o n n t e q >art 

A dirty bit an ay, which hai; no t^ouriteipart in the instruction 

c ache. Tiiis array keeps trat^k of whether a data cache line 

has been lurjdilli'd by thc^ inshin lion sireajii. 

Altliongb oigaiiized in ;i way similai" tothe iiTStmcIiun eaehe, 
I he data cache's intenuiJ operation mid effect on the t'Pl" 
jiiljeline are tjuite different. Ttie data cache and TLB oi)erate 
in the A aiKl B stages of the piipeline. A load instruct ion 
causes a data address to lie ^eneratetl in tlie first half of the 
B Slage, The data i arhe word line <iecodei>; operate on the 
sectnui luUf of the B stage. On the first hdf of the A stag*', 
the (irrays drive their vidues out. Baserl on the comparison 
between the phy steal a fid less and the oinpnt of the tag 
arrays, the way mulliplextu- tlK^n selec is the proper data 
value. This word or double- word value is ilit^n driven to ilu^ 
integei and Iloating-point units during iheseeond half oflbe 
A sfage. 
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Fig. 7. 'I1i«' srorc cjiiciit'- 

A store instniction generates a data address in tlie same 
maimer as a load instruction. Thai address is used to read 
Ixom the tag array as descriljeci above, histead of using the 
store acidress to read tViJiu tlie data ;ur;4y, htm t'ver, ihe ad- 
dress from the head of ;i fwo-eiitj^ sloiv fjiieiie (F\g. 7) is 
us(Hi to mdex into \iw data airay on tlie second hall o( the 
B stage, Tlie data from the liead of the store queue b written 
into the data aiiay o!i tl^e fust half of the A stage. The datii 
from the store inslriKiion is driven from tlie inleger <jr 
noathig-[)oiiu umts to Ihe data tatlu' on tlie stn-ond half of 
Ihe A stage wliere it is written hitcj Uvt* tail of die store i|iieue. 

Store Queue, A load can retrieve data directly out of the store 
queue if it is to the same address as the store. The necessity 
of 1lie store quetie is twofold: 

• The flfjatingixjini unil CLU^iiot drive store data in time to 
write the data ajiay during (he [Jioper f>ipeline stage. The 
store queue, therefore, pro\1dt*s the tin\e to trajisfer the 
data from the execution imits to tlie data cache. 

• Menioi~>' carmo! be modified until it is kiTown dial the store 
instniction will pro]jerly finish execution, [f the store in- 
si ruction is going to trap, say, because of a TLB fault, any 
aichitected state, such as memory, moat n(Jt he change<f 

Tile disposition of ihe store instruction is not known until 
the H stage of the pipeluie, well after the data airay is to be 
written. The store queue selves as a temporaiy l>nlTer t(j 
hold jjemhng store data. If a sf (jre that writes into Ihe .store 
queue sut>seqnentiy traps, thai store queue entiy is merely 
invalidated. Also, by using a store queue, we ai^e al^le to use 
a smgle biduectional luis tcj rraiLsfer data between tiie exe- 
cution units and rhc data cache. The store queue alh>ws data 
to be transferred on the second half of tiic A pipeline stage 
for both load aiid store instructions, preventing conflicts 
between acyaccnt loads and stores m the mstiiictioii stream. 

Semaphore Instructions. The fiat a caciie performs other menv 
oiy oj.>erations bt^sides load and store instructions. It handles 
semajjiiore instnictions, which in the PA-RISC* architecture 
require a memoiy locaiioit to be read while that location is 
sinuiltaneously zeroed. In operation, a seniaphon* is quite 
similar to a store mstiTiction with zeroeft data, excei>t tliat 
the semaphore read data is transferred on the second half of 
the A stage. In cases m which tlie sema|jliore is utit present 
or modified m the tiata cache, tlic ioad and clear operation 
must be perfonned by the memor\' controller. 

Fiijsh and Purge Instructions. We must also execute flush iu- 
sirnrtions, v\hieli cause a given niemon- locati<ai lo be Ccisr 



out of die data cache. Related is the purge mstruction, wliich 
at 11 u^ uiost pri\ile^i*d levtvl causers a jnenu)!^ location to be 
invalidated in tlie data cache with rio cast otit. even if the 
line is rnoilified. 

Reducing Miss Latency. I lata ca< fie misses aie detected on 
the rir.st iujjt Df ihe A .stage tjf tlie pipeline. To reduce ini.ss 
latency, the physlc^U a<1dress being read from the data (ache 
is forwarded lo the jiu-mory ctjiir roller before tin* data cache 
hil-or-niiss dist^jsition is kJUiwn. Tins atldress is drivei^ to 
the memory controller on tlie fimi half of the A stage. A "use 
address'' sigtiai is dnven to I lie memory controller on tiie 
first lialf of the R stage if a cache miss occm's. 

Gopyin Transaction, A number of trajisaction types aie sup- 
ported between tlie CPU core and the mem 017 controller. 
The most common tyj)e is a copyiii transaction. 

After receiving a copy in requesl, ttie memory contr<jllei' re- 
tuiTLs the rt^quesled cache line, orie double word at a time, 
.^s w ith instniction misses, Ihe memory controller returns 
the data double w ord that was originally Intended to be 
fetched first. 

On kiad misses, when the crillcal double word m rives, it Is 
sent directly lo the execution units for post hig into the regis- 
ter files. t>n integer toad misses, the ciitical data is by]>assi'd 
before error connection to reduce latency even fuitlu^]- 

In the extremely rare event that the data contains an error. 
tJie CPV is prevented from us i tig the bad data luid forced to 
wait for coireeted data. As each double word anives from 
I lie nienioiy controller, it is placed into a copy in buffer. 

When all tln' data luis arrivedn the conleuls of Ihe copyin 
Liuffer are wrilten to live data cache <lara array in a single 
cycle. Tliere are actually two coi>yjn buffers to ensure that 
two data cache misses can be handled simultaneously. 

Fig. 8 shows a block diagram of Ihe copy in and co]>yont 
buffeis, 

Copyout Transaction, A data caclie hue can contain modified 
data requiring iiosting or wilting back lo memoi> when cast 
out. 1Y> litis end, anolber Iransaction tyt^e is implemenied — 
a coi)yout trtmsactkni. A ctjpyoui is necessary under twt> 
circumstances. The fii^t case is when a data cache miss is 
detet ted and the existing caclie line selected for replacement 
has beeji motlitled. This is the most common case. 

The second ctase is when a fiush instnietion is executed LUid 
hils a modi lied line in the data cache. Tlie data cache sup- 
plies both a physical address autt 32 Ijytes of < lata on a copy- 
out The data cache uses the checkerboaixl organization, so 
the fidl i^ache line lead for the co|)yout takes only one cycle. 

Reducing Cache Miss Penalties, hi the PA 7;J0tlU\ we have 
taken a nun 1 tier of steps to reduce the penalty caused by 
cache misses. As menlioned afjove, we have reduced cache 
miss latencies. We have also couiiiuLed t<j atiopt a "sialhon- 
use'' load miss policy pioneered on ear'Uer PA-RISC designs.^ 
hi tills policy a load miss stalls tlie CPI' pipe hue only long 
enough to issue tiit^ cop.\in transaction and possibly a copy- 
out nmisaction. In many cases, the delay Uisis for only one 
cycle. The CPl^ will tlien only stall when the target register 
of the load instiuction is sul>sequently referenced. If the 
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t Tjtical data retiuns from meinoi^ fast, enough, the pipeline 
will not stall at all. 

Becaust^ memory data is not noeried by the CPl ^ un a store 
miss, the CPU only stalls once, again for only one eyele in 
majiy cases, to issue the copyin and copy out transactions. 

A si'orelioartl keeiJ.s i rai^k of which words have been storefl 
so tliat the copyin write will not overwrite more recent data. 
Since high-l>andwidth writes to i/0 space can be critical 
to graphics performance, under most circimistanees tlie 
PA 7300LC will not si all on a store to 1/0 space. This opti- 
mization is possible because an 1/0 space access is guaran- 
teed to miss tlie data cache, so there is no need to stall the 
CPU to perform a crjpyotit read. 

Cache Hints. Ttie PA- RISC' architecture defines t^che hints to 
allow^ the jirognttnmer or compiler tcj commimicate infomia- 
tirm that can l>e used by the hardware to imjjrovc perft>r- 
inance/^ We have implemented two of these hints r>n tlu^ 
PA 730f)LC: 
• likick copy store. Hints are used to ind irate Uiat software 
ijUends to write an entire cache line. In this cjusc, there is 
no iier^d to perfonn a main niemoi^^ rc^ad oti a cache miss. 
Witli this hint specified, upon detecting a store mi^, the 
PA 7:i()(JLr simply zeros out a c{j|>yin buffer and continues 
without is.suing a copyin tnyisaciiort 



Fig. 8. A blof'k diagram of the 
(^opyin and tiopyoui buffers. 

• Coherent operation semaphore hint. This optimization im- 
pro%'es semaphore rierformance by not forcing the loail and 
cleiu^ operation to the inemor>^ unit if the data is present in 
the cache. 

TLB Access. A\\ memoiy reference instructions are guaranteed 
access to the unified TLB containing both instniction and 
data translations, during the B and A stages of the pipeline. 
TIk' TLB is firily associative and contains 9ij page transla- 
tions. Ttie TLB receives a \irt ual data address on the first 
half of the B stage and drives a translated physical address 
on the first half of tfie A stage. Tiris ijhysjcid address goes to 
the data cache to periorm hit conipimson ;iiu[ to t]ie menior>^ 
controller in anticipation of a data cache miss. 

In addition tn containing 96 page entries, each of which 
maps to a 4K'byte page, the TLB also contains eiglit block 
enf ries used to map larger memory ranges. These block 
entries aie nianaged by the operating system. 

CPU Summary 

.'\lt hough the C.^PU core of the PA 7:iOfJLC is not dramalically 
different from its predecessors, several noteworthy featin-es 
that imi>rnvc |HTfonnanceand aUow more cost-effective 
systeui designs incluiie: 



.hmv I mi lie wlHt - F*arkiif ( f Jn n ma! 5 5 



)Copr. 1949-1998 Hewlett-Packard Co. 



• A simple pipf Jine and a capable siiperst.mlar rorL^ that 
incrp;iKe(l om o]n'nil\n^ frpqiunK y to 160 MHz. 

• Suhstmiiial priiiijuy cat 1k»s inlegiaicd dircrtly onlcj iho 
f^HK t^ssDirhip 

• Mosl iiiiijcjitanr, tadic tujiiKjiJcis that tak(* advmitage of 
integratf^fl caelies, resulting in featuresi fjpsigiu^fi U\\o the 
CW tore to iiK'reasiMlu^ conijiPtitivenesK of r*A 7;^f)flJ.C- 
bast'd syslems. 

Memory and I/O Controller Design 

The iTiernory and I/O controller (MTOC^) is responsible fnr 
interfacing the CPII core to the niemoiT and 1/0 suljsyslenis. 
Integraliiig !h(^ M10(" on I he smwv chip ;ls tjie CPU core pjo- 
vides a tiglu coupling that jcsidts in out standing memory 
aiitl I/O performance. The memory ctjntro Uer includes a 
main meniorj^ controller and a controller for an optional 
seeond'level cache. The E/O controller interfaces (he CPIT 
core to Hi*s general system connect ( (fSCj I/O bus and hmi- 
clles direct niemoiy acc-ess (DMA j requests from 1/0 devices. 

CPU to Mice iDterface 

The CPU core tr;nismtts four basic ty^ies of refinest to the 
MIOC: 

• Copyins, These requests occur dining first'level cache 
misses mid mv userl by the C;PU core to read a cache liite 
from the menu ay snbsysienr 

• Copy outs. A copy out is a cache lijie from the ('PI i core that 
must be written to the memory subsysteju Liecause it was 
modified in the first-level cache by a store instruction. 
Co]>youts are only issueci when a mot I i Tied cache line is 
replaced ur flushes 1 from the first -level cache. 

• IJncacbed loads mid stores. An nncaebed load orsfore 
request Is a read or write to either nietnor>' or I/O for an 
anioimt of t la! a I hat Is less than a cache line. 

• Load-and-clears. This request is iui indivisible request to 
read a location and (hert clear it. Tliis operation is needed 
to implement PA-RISC s semaphore mechauism. Requests 
that have addresses located in mernor>' address space me 
processed by the memory' C(aii roller, and all others me sent 
to the I/O controller. 

Tlie PA 7300LC has a four-entiy copyout buffei-. ("opyouts 
are posted t<j memoi-y as a backgrf Jund operation, allowing 
€0|>yins to be processed before eopyottts. New copyin re- 
quests are checked for eonfliet within the eopycjut buffer. 
If I here is no conflict, the copyin is processed before all 
coijytjuts to help minimize load use stalls. 

See and -Level Cache Control 

Even Ihnngh first-level ( athes on the PA 7;3O0LC are lela- 
ti\ ely large for integrated caches, many a|)plicatirjns liave 
data sets that are too big to fit into ihenr The second-level 
cache (SLC) implemented for the PA 7.J00LC helps solve this 
problem. Logically, die SLC appears as a liigli-sjjeed memory 
buffer; other than its t performance improiemcni , i( is trans- 
parent to softw^aie. Tlie .SbC is physically indexed, is write- 
Ibrough, has unified instructions and data, and is direct 
mappetL 

The SLC becomes active after an access misses the first- 
level cache. The fii'st-level cache miss indication becomes 
available after the TLB delivers the real address. As a result* 
there is little advantage to virtually mdexing (lie SLC and 



real indexing avoirts the aliasing problems associated with 
virtual ea<^fies. 

MuFttway Associative Cache Comparison. .Multi way associative 
catiies eryoy hctier liil rales Ik cause of fewej (^nlHsicjns. 
However, imdtiway caches art^ slower Iuhhusv of way selec- 
tion, and for a given cache hvav, are much mtn e expensive 
lo implement with indnstTy-stajtdard components. For most 
aiiplications, it is more adviuitageons to trade off .si/e for 
w ay s o f ass o< i a ! i v i t y . 

Write-Back Cache Comparison. Writ el jack caches^ generally 
have better |jerformance than wrile-ihrough eaclies, llow^- 
ever. shaiiiig the data bus with main memory allei s this situ- 
ation, If the SLC were write-biit^k. lines copied out of the 
SLC would bav(^ to be read huo the PA 7:^f)0LC. the error- 
(orreclingcode (ECC) would have lo l>e computed, ;ind (he 
lini^ would have to be written hack to ni<un memory, riiis 
operaticm would be quite exjjensive in terms of iais ]>aad- 
width. Instead, dirty lines east out by the nrst-level caciie 
aie written to the SLC and to jnain memory simultaneously. 

Any valid litK^ in the SLC always has the same data as its 
corresponding lofalion in main memoiy Writing simulta- 
neously lo uiain memory' and (o the SLC is shghtly slower 
thmi simply writing to I be SlX^ SRML but produces a g(jod 
Iieribrjiiance and complexity trade-off when t^ompared to a 
write-back design. 

DMA Interface. DMA reads and writes fujm I/O devices are 
iy|]ically scqur'iilial and (iu no! (^xlnbil the access locality 
pal terns lypical of CPC traffic-. Entering DMA tiaffie into I fie 
SLC leiuis lo ])ollute the SLC with ijieffecti\'e entries. Insiead, 
bufferini^ mui i>refetcbi[ij^ inside the D^L4 intetface aie better 
w^ays of irr^proving DMA perfonnance. To maintain consis- 
tency, an SLC check t ycle is rini for f)MA w- rites, and if it 
hits, the line is marked invalid. DMA w^rite data is always 
written to main memoiy and DMA reads are always satisfied 
from main memoiy Because of tlie W' rite-through design of 
the SLC described above, data in I he SLC never becomes 
stale. 

SRAM Components. The PA Z'^OOLC is optimij^cfl lor both 
price and performance. Relatively early in I he design pro- 
cess, it became necessajy to select the slattc Ffindoiu access 
memory (SRAM) comfjonejUs usefj to build the SLC. SRAM 
cruut^ouents are freiiuently nsed in cache cr>ustntctitm be- 
ciiiiise they offer high sjieed with moderate crjsl au<i eapaci- 
Ties. Given the relatively long design cycles necessary to 
produce a complex tnicrot^roeessor mid the nneertainties of 
the setuiconductor marker|flaci\ it was rm|>ossible id predict 
whicfi t omponenls would be most attractive from a )jrice 
anfl ijerforniance peispective when the PA 73UULC entered 
full j>rofiuct ion. Instead of selectmg a single component, the 
decision was made to siipt>ort a bioad ranj^e of SR.'XM t^ses. 
Tlris iillowed com[(oneiU seleclion to ])e nuide late in the 
din elopment cycle and even upgraded at some pomt during 
I he prodsa lion phase. 

Second-Level Cache Size, Most popular computer benclmiark 
programs have relatively small working sets and aze not 
particularly sensitive to the performance of the memoriy' 
system beyond the fusidevel cache. ( )n the other haiicL 

' In a write- back cache design Istso called eopv-tiackj. dai^ is wrinen pttIv to tlie <:ache on 
5 write, and i? nut WTittsn to main mfifrrorv until the cache line fs rnvflfidated Jn a vvtrte- 
ttinii^gh cache design^ data is wmm to both ihetacte and mam Fnefnory m a wnte 
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applicalian programs liave widely variable working sex 
sizes. Some arc small and fH vveU in the first-level cache 
iiiid some exhihn Utile reference locality aiid dnn'i fil in any 
reasonably siztil i-achc. Hetice. no single HLV .sixe is apprci- 
priale. The PA I'-MILV SU' contrtjUer supports catlte sizes 
ranging from 25(jK bjites up to 64M bytes, Althougli a MM- 
byte SLV is exisensive* it might be cost-effective for some 
applii'aiions- 

Tlie SLC daJta an-ay viidth can be programmetl to either 54 or 
128 bits plus optional EC(\ However, the width niiAst match 
the width * if main memoiy. 

Memory Arrays. The SLC consists of two memory- arrays; the 
dat^ array and tlie lag an^y. The data array shares the data 
Jnis with main menioiy. As an option, ECC bits can be addeci 
Id I he (lata ai-ray. «md the full sinjile-lnt correii and double* 
hit detect en'or control invdketl for SLC readsi. Tbt* Uig amiy 
juiindes a single optional parit>' bit. If parity is enaljled and 
l>ad parity is detected on a tag access, an SLC miss is sjg- 
milvil the failing lag and address are loggtHl, and a machijie 
(heck is i^ignalcd. 

Main Memory Control 

DRAMs I )ynatnii' random access memory (DRAM) teehnol- 
ii^j^' i!s used to cimstmct main ntemoiy because of its high 
di-nsity low cost, and reiisonal>le pertomiaiicc levels. The 
main niemoiy contjoiler sup[*orts indusny^-staji<l;ird iJRMls 
from 4M-bit to 256M-bit capacities. Systems can have up to 
ll:i slots and total nicmoty^ c;m be up to "1750 bytes, the 
maximmii possil)ie with the PA-RISC 1,1 architecture. 

Data Bus Width. iJata Ims widfli can lie tither iv| r^r liS liits 
pltis ()plu)ri:il Etc. Tlie 128-bit diila hn.s wifilh signiiiriiiitly 
intproves memory performance. The ()4-bit aplion sttijfjorts 
h )\v(T-cost systems* 



Main Memory Controller. Tlie PA T^JIMU/' main menior>" con- 
trol ler Ls \i'n llrxilile ;ind is able to suptKirt most tyjies of 
asynchronous fiRAMs. The controller is mtentionally not 
SIXLM'DLMM (single or double inline memory module ^ spe- 
cific. Tills alltHVS use of the PA T^ttMlLC in a wide variety of 
system conOgyrations. The main memory^ cim support ex- 
tended data otit (EDCJ) DRAMs, which aie simitar to other 
DRAiis but ust^ a slightly modifted protot ol ihai pijielines 
tlie c^jlunm access. 

Pig. !* shows the tiiinng diagraiiis ot rea<l acci^ss^^. enipliaisi^E- 
ing the improved data liaiiclw idth of ED* ) DR.\.Ms tiimparetl 
to standard page-mode DRAMs. 

Error-Correcting Cade. The state of DRAM memor>^ ceils is 
susrt^I>tilile tij I. uniiption from incident energetic atomic 
panicles. Because of this, (he PA 730()LC main tnemoiy con- 
troller optionally generates and checks mi erroiH-orreciing 
code. Tlte c^tjde is general ed over a ri4-bil daia word. Any 
single-li*it error witliin the (i44>it data wfird e;ui lie c*>rreeted. 
AH double-bit enors and all thrtn^- or lom-bit enors witliin an 
jiligned iiilible can l>e detectetL Tlie iiligned nibble capability 
is useful siiirf iticmory systems are l.vpic ally built with fotir- 
bit-wide L)H/\Ms. The nibble mode capal)ility allows detet tion 
of the catastrophic failure of a sii^gle fotir-bit-wide DRAM. 
Whenever an error is detect eiL data and address logging 
registers are acl hated to supp<jrl elTiciem fault is(jiation and 
rapid field repair. 

Shared SLC^ and Main Memory Data Bus 

From a cost pt^i-spi^ctive, it w;is desiral>le to share tiie Im-ge 
data buses nei-ded for the SLC mul main memory, (liereby 
lowering tlie i)in comil; of the PA T^DdLC. However, sharing 
the large h>ad from main nienKJiy IJRMI nmU would ha\e 
significantly iini*actetl the speed of SLC operations. The 
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Fig. 10. PA 73mA' block diagram shewing the position of the FET sv^ltch. 



solution to this probk^ni resulted in using an FET switch to 
isolate the main nn^inory load from the SLC bus when the 
SLC is drmng the I jus. l)Lit to ^itlow the has to be shared 
when main menioiy is being accessed (see Fig. lOJ. The PET 
switch is a relatively inexpensive industry-stand ai'd paii , 
which has a propagation delay of less than 1 ns when in the 
on state, 

FET Switch. The FET switch also enabled us to connect, the 
PA 7:K)0LC to legacy 5-volt DFiAM cards. Tiie PA 7300LC 
operates at 3.-.^ volts and is not tolerant of the 5-volt logic 
swmtg of many existuig DRAM cards. Biasing the gate of the 
FET switch to a voltage lower than 5 voits effectively hmits 
the ^'Oltage swing from D1L\]VI cards to 3.3 volts wlien seen 
by!hePA7300LC. 

Chip Layout Challenges 

Although the MIOC is a small r^art of the PA 7300LC, it con- 
trols nearly all of the 1/0 puis. Because the pins are located 
at the chip perimeter, long signal routes from the MIOC to 
some puis are unavoidable, Sei^arating tlie MIOC mto several 
blocks that could be placed near tlie chip perimeter and con- 
trolled remotely helpeci manage this problem. In particular. 
tJie data flow across tiie shared SLC and main ntenior>' tlata 
bus Ls completely predictable (because there are no slave 
handshakes from the memories), making the memoiy^ data 
interface the ideal block to be controlled from tlie other side 
of the chip. 

Cache Miss Data Flow 

The MIOC is highly optimized for satisfying CPU cache 
misses, .^though DMA transaction processing is handled 
efficiently, system performance is more sensitive to CPU 
cache mi^ performance than DMA perfonnance. 



When idle, the SLC and main niemoiy controllers pass 
through physical ad<lresses that are cojiiing directly front 
the TLB and going to the SLC and main memory address 
pads. On the cycle following each address, the CPU core 
indicates w^hether that address restdted in a miss m the first- 
level cache, if a miss occurred, then an access is initiated 
ami a cycle is saved by having passetl along the physical 
addresses to the SLC and main memory. 

For copyins, the SLC begins aji access. The tag and data 
an ay are accessed in parallel If there is an SLC hit, then 
data Is letttnied to the processor. 

On an SLC miss, the SLC data array data drivei^ are disabledt 
the FET switch is closed, and control is transferred to the 
mait\ memory controller 

When a transaction is received by the main memory contr^- 
ler, it endeavors to activate the correct DRAil page. This may 
be as simple as isstiing a row address strobe (HAS) with the 
proper row addiess, or may reqttire deasserting RAS, [ire- 
charge, and a new RAS. Tlie memory controller sequences up 
10 the point at which it is ready to issue a cohinm address 
strolie (CAS) command, waits there until the SLC misses, iind 
switches control over to complete the CAS comniiUicf How^- 
ever, if tJie SLC hits, it \vdll wait for the next transaction and 
start the cycle again- Performance is impro\fed by stating 
the DRAM access in parallel with tlie SLC access. 

In the case of rni SLC miss, once the main memory controller 
has control it Issues the proper numl>er of CAS cycles to 
read the data. As the data passes the SLC. it is latched mto 
the SLC data array At the end of the cycle, the FET switch is 
opened, il\e SLC rlri\^ers aj'e enabled, and tiie next iraiLsaction 
is processed. 
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Reducing Low-Mbs Utencies. Much of I he work ties<'rttied 
aliuvf t (iiuvras rccliu ijig juLss latitudes. This is imi.>ortatil 
because even though the PA 7-M\W CPU core hasa non- 
blocking catiie, load iLse stails *iliIJ fIe\x4op qiiiekly for niafiy 
inslniction sequences. Ixjw-uiiss latencies minimize the 
impact of these j^talLs, wJut'h resuJtH l^n herter oxeraJl perfor- 
miiii(-e. Ai (TF clm^k ratt^ of UK) MHz. the PA 73tWU\ as 
seen tjy the VPV pipeline, is ea|>al*le of 8LC hit latencies 
i>f iJiree c>rlc*s wiih iiuiusir> -suiufiard (Vns tLs\Tirhronous 
SRAI^i Main menuint^ latencies can l>e as low ils 1:{ cycles 
with r>0-ns DRAM, Many single-t^ycie latency ri^ductions 
have heen iniplenienteti in Ihc VA 7:J00L(.'; ei^ch by itself 
would nor [ia\ e mucfi im]jact on overall nien*or>^ access la- 
tency, but taken togetlter they make a significant difTerence. 

I/O Interface 

The PA T:M\LV cf »ntaijis interfac*^ logic that allows direct 
c*mnectiou to HP's liigh-spe*'d GSf I/O liiis. Tliis interface 
processes VO requests iioni llie CPF core and DMA rc*- 
quests frtmi GSC I/D Ims devices, 

Prosrammed I/O. Pragnminied I/O iillows load ainl store in- 
si ructions from the CPU core to comnmnicate witli the I/O 
siitisystem. From a peiformance pei^spectixe, prfjgrajmned 
I/f ) writes lo graphics devices ate imr>ortant lor many work- 
station ajJi.ilications. The improvements made for graphics 
pt^rfonnance in the PA 73(K)LC' are described later in this 
ailu-le. 

DMA Interface ControlFer. The DMA interface conti^oller is 
di^signed \o minimize niaiji memoiy controller traffic and 
to reduce DMA read latency. The DMA interface controller 
ettiploys three 32'byte line hufft^i's. Wieii semcinj? tmy DMA 
read, the controller reqnests '32 bytes from main nttnnnrj 
ajtd (vnts the data into one of the htiffers. DMA re<inesis (Jii 
the (JSC bus tUriy be 4. «S> 1(1. or '12 hyies Utng. Since most 
t)MA iet|nesls are to si^mtential adthesses, ret^tiests It^ss 
thaii *12 b\ies can proljably be satisfied front data contained 
in tlie buffer without issuing imotlier n^iuest to the main 
tnetnoty controlhT. Thi' DMA <'(»nlrolltn' is ;il.so able to \>vv- 
fetch the next sequential Utte (»r infortviatitni ttj incrt^jLse the 
ch^mces that DMA trail tequeslsat*e setTieed horn tiie DMA 
biilTeis. 

GSC Write Re{|uests. Writes are collected by the DMA hard- 
wsui- and piussfd on to the main lueintirj^ cottt roller. CSC 
write requests of 32 bytes are S4*nt directly to the enntrr^ller, 
1m It witcn possible, siuulb't-si/.(^<l wiiles are </nllectrd into 
32-byte chunks by the DMA ixintroher to allow the tnain 
tnenu>ry ctjtu roller to aceess memory more efHeienlly. 

Improvements for (iraj>hies Applications 

riraphi<*s t>erforj nance depends on many ast>ects of the 
system design. In addition, graphics w^orkloads are sensitive 
t(? the systetti architecline. For the PA 7:iOttL(\ w(* chose to 
opliiut/,e the design lor t/rtginec'ring gra|)hics. where the tvin- 
eal wrirklcjad invulves rendi'ring nu ulijec! Iri I hi' fhsjjlay 
devic^e. 

From a high-Iev<*l jx^int of view, the t>rocess of re n deling ctti 
{)bject vm\ he divided itito three st**ps: 

1. "f^^aversitig the display list that iU*scrib<\s the c)bjc*ci 

2. Cliijplrig, seal in J?, atid rotating the object with the current 
\iewpoini 



rl Thu^sfo^Tt^ng the object from primitive elements, such as 
polygons* into [axels on the screen. 

This proc-ess can bt* partitioned in different ways. With 
today s powerfiil CPl's, the most eost-e^ective method is to 
store the clispiay list in the cotnptiter sysieni s main memory. 
Tlte host (TT perforin.s the dLsplay list traversed and the 
elijjping. sealing, atul rotation sti»|>s, and dien t'^sses primi- 
tives to dedicated graphics hardwaie for conversion into 
oiLscTetni pixels. 

Graphics Requtrements. Sinenil different models, including 
si>e<iah?.ed t PI uisinictions and DMA engines, have been 
used to extract data to l>e rendered frcJiit main niemor>'- 
Wtiile these approaches wcnk, they incur the undesirable 
cQjst orspet^udis«.^d drivtvr software that doi^n't jtort well 
tielweeti [jroeessor getiemtions. Starting w-iih the PA 
TIOOLC. tliephilos(j[>liy iias been to sui>iMin the graphics 
re(4uireiuents within the exisiing mx'hitecture iis mu4li as 
po?^sihle. For exam t)ie. the PA-RISC iurchitecture defines a 
set of ■i2-!)it integer unit general registei"s and another set of 
64-bit floating-point unit general register, l^oads and stores 
from either set can be made to memory space, but only inte- 
ger register loatis and stores were archilectuniily defined to 
UO space. 

Stai-tlng with the PA TlbtJLC. fioating-poiin legister loads 
and stores to I/O Sj>ace have heen implemented. This has 
yielded imjiroved p(*rfomwmce becrause a suigle load or 
ston* vim now move tVl bits and because metre* registers are 
available for t>]KTations that conuuuuicati^ with 1/0 space. 

In contrast with specialized operations, extensions within 
the aichitt*ctiin* are generally applicable and can> forward 
into futiue genc*rations. Tlvese ojJthniziitions can alsf) be 
used t(j benefit workloads other than grajihic^s, 

Graphics Optimizations. SmririJ of tlu^ optimizations made in 

the PA T'itMJLr to li 111 her iuifjrove graphic ^s performance 

iticlnde: 

A liuge I/O store Iniffei 

A rel:i\ation of the load mul store ordering mles 

The eliniinalion of a CPl ^ hanj^ cycle t»reviorisly m^t*ded 

for I/O stores 

Improvejnents to the GSC I/O bus |>rotocf>l. 

The struct lue of itK lust ry-sian diird graphics libraries leads 
to hiu>!ty graphics I/O traffic. The hursts :ue of many difTer- 
ent. sixes, btn the most ctJimoon burst is a write of 2(j words, 
llie PA 7;]U()Lt* t*Pr core4c>l/( > interfact* im|)lements a lai^ge 
write bi lifer and citii accept nj* to H* doubk>-word writes 
wlriuHit sfalitug tlu* rp[ ■ pi]>eline. Tins allows the C'Pt " core 
to burst up to M) double-word writes to the 1/0 subsystem, 
atid then continue with its next t^Lsk while the I/O interfiiee 
is sending tiiis data cnil to the i^raphics hardw^ue, 

Graphics Ordering. P\-KJS<' is a strongly otnti(*rect jmhitecture. 
Strongly onk^red tutsans liiat all cleineuts of tlte systetti tiutst 
have a cojisistt^nt vU-^w ot system oj^eralir^ns, hi the case of 
grnphics perfonufmce, this metuis that all buffered 1/0 shaes 
must I>e obser\<nl by the grat^hics fk^\1et* bc^tore the ( PH cim 
ac<ess a suttsetnient |iie< e (if data in liiaiii nietiioiy Hence, 
an I/O store and a folhiwuig nuuuory read are serializecL 
A loophole to the ordering reiiuinnutmt was trreated for 
grajahics. I/O stores withiti a proj*rammal>le adthess range 
are alir>wed to be (itil-of-order with rcs|H^<t to tlic^ memory 
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arces!5es. Tho graphics software takes responsibiliry for 
f>rflering wht^n necessai^; 

Hang Cycle, Pixnaoiia PA-RISC' jjrcjt essors always infiinptl a 
iTiiiiiEiium of one liaiig cycle for ari I/O store. I^xtra logic was 
mkU?f\ to the data cache controller on tlic (VV core to elimi- 
nate this han^ rycle, 

Graphics Transfer Size. Ill's liigh-speed GS(' l>us is used lo 
couneci j^-aphics adapters f o the PA 7-i()0LC. The CPL^ sends 
daia to the graphics device wiUi I/O stores. In (he PA-RISC 
arclilieciure. 1/0 stores are tU Ijjts or less. The GSt' is a 
32-ljil multiplexed address and daiM bus. Srures of M hits 
turn jutfjmi aci dress cycle foUnwed by iwo data cycles. At 
best the payload caji be only two tJiird.s of the maxiniurn bus 
baufiwidd). As tuentionecl abo^■e, the average (ransff^r size to 
graphics is 26 wortls. Since these transfei's are se<iu<^n(ial. 
sending an address witli every two words is uiui^^'cessar^. 
Some fonu ul addi ess sujjpressi(]n or ciiist eriug of seguentiiid 
writes was desired. Thus, the write-variable transaction wus 
ciealed. 

Write- Variable Transactions, A uew^ wTite-variable transnelioii 
ty|>e was crealed tor ihe (jSC bus, Write-vmiable rnuis;u tious 
consist of an address and htun one to eight data cycles. Since 
the PA 7:]0t)U" mast be toiupatible wiiii existing cairls tliat do 
not implement the wnte-variable cycle type, the PA 7-X)0Lr 
only generates them in coti fig arable adtlress spaces. 

With this protot^ol, the 1/0 controller blindly issues wrile- 
variable traiisarlious tor enabled 1/0 address regions. Suiit- 
ini^ whh the initial wriie, ^i-s c^ach write is retired lium die 
1/0 writc> queue, the I/O controller performs a scquentiidi ty 
check on the next transaction iti the queiic. The process 
repeats for up to eigld G^C data cycles. Maximum perh^r- 
mance is achieved l>y allowing die I/O coiUroller to begin 
issuing the write w-ben the llret piece of data becomes 
availal>le. 

The length of the transact ion is linnled lo eight data cycles. 
Choosing eight data cycles is a good compromise between 
flow control issues anc! amortizing address cycle overhead 
w i i h p av load. Tl i e W' ri t e- variab le en h an t e i ue n f i n e reased 
maxinunn C'PlMo-grajihics bandwidlh from two (liirds of 
tJic GSC: raw bandwidlh to 8/9 of the raw bandwidth. Tlie 
PA 7300LC can easily saturate^ the GSC has at 142 Mbytes 
per second compared with the: '>0 Mbytes t>cr secimd 
achit^vc^d by the PA 71(K)bC whh caieful eoding, 

MIOC Summarv- Vhe MlOiS implemented a number of features 
thai improve systeju perfonnance wliile keeping costs low, 
hicluduig; 



• Tlie second-level cache and main memory controllers ai'e 
optimized to reduce the latency offopyiu requ(?sts from the 
CPl^orc. 

• Tin* U() contrcjller im])roves graphic s bujxlwiflth and sup- 
ports effK-ient tXMA accesses through the use of buffers and 
prefetc:hing. 

• The MIOC Is designed to be flexil>le, suijportiiig a r^mge of 
siHond-lewi cache sizes, a varieiy cif indusir^-sUmdard 
mejnoiy components, two different memory w idths, aiid aji 
optional eiTor coiTcct ioji scheme. 

Gonclusian 

The PA 7300LC design builds on f he success of past proces- 
sor desigiLs ajid cjffers .signi fu ant impimc^ments in key meas. 
Il ft^aturesa superscalar CIH' c*ore. a large, enicient on-ciiip 
cache organization, tightly c^oupled second-level cac:he and 
main memniy controllers, arul bandwidth improvements for 
graphics. These features comliincc I with hec|uency increasc^s, 
exteasive conjlgtjrability, ajid high chip C|uality maki^ the* 
PA 7300LC attractive for a wide range of computer systems. 

Af'knowl edgiii en ts 

A largc^ numlH'r of people were respc>nsible lot the succ*ess- 
ful dc^sign c>f the PA 710nLC\ c^spec ifdly the design teiims fnnn 
die Enginc^ering Syslems Lkib m Fort Ctjllins and from the 
huegrated Circiuts Business Division's Fort Collins Design 
( >\]i er. M any i n i p ort ai it co i n ri b ut i a ns w ere a I so i nac lei ly 
indi\ iduais i'nmi I he Fc)ii C(jliins SysU*ms Lab. the Syslt^nis 
PerfonUcHJice Lab in Cupertiin:>, the Computer Techitolog;y 
Lali in Cupertino, artd ol her organizations within HP 
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High-Performance Processor Design 
Guided by System Costs 

To minimize time to market and keep costs low, the PA 73Q0LC design was 
leveraged from a previous CPU, the chip area was reduced, cache RAM 
arrays with redundancy were added, and high-speed, high-coverage scan 
testing was added to reduce manufacturing costs. 

by David C> Kubicek, Thomas* J, Stillivan, Amitabh Mehra, aiid Jolm G, McBride 



While dcsti^ningthe PA T-^OULC prrH;i-8s<>n the i'PV Wtun had 
to nuike rleHij^jn rra(itM>rfo t>etw(H'n ihiie to markH. perfor- 
luant'i', and niannfacruring cosrs, ( k-rasif >nal]y tlu\st^ fic^f^ni- 
iugly rani raciii ioiy goals workeci Ujgether to dilv*^ the leaiii 
to a dec-ision. More often, however, the team had to make 
liard ck^cisions, weighing tiie benefits of each of the design 
goals. 

Tills paper ciiseusses the strategies iised by the PA 7*jiOOLC 
physicjil design team to inipk^nenl the design goaJ.s for the 
PA7:J00L(\ 

Design Goals 

( h^e of the factoi's driving the design process wits die desire 
to bring the product to market as tast iis possii>le. To accom- 
plish this goal, we ennjioyed direc nu^ior strategies: 

• Leverage a.s mnch aw pussihie tram I he i>re\ if jus HP proces- 
soi>;. inc!n<ling hardware* software. lUid methotbkigies for 
design and les! 

• Design qnaUty into phase one, or the presilicoii design 
stage, so that there won Id he fewer iterations nf iho design 
( i n ri ng t>hase t w f j , ;i ft v i i I le h i^\ {n\iv nA oasi^ 

• Monitor projt^et projiress. avoiding any nbstacles tiiat might 
seriously impact or ttireaien our scttedule. 

Keeping the cost of the system as low as possible was an- 
other impoitaiit goal of the project* Systems Ijased on the 
PA 7:j0fJLC are meant to r^nsition HP in the low-to-midrange 
w^orkstation market where [rrict^s art* set Ijy rompi^niifiii, oui 
system (*ost. TI\iMehire. s;ivinjt*s in I fie syslem cf)st have a f>ig 
imiiact on profit. To meet these air as, the team (kn'ided to: 

• Imegraie tlie fiiTit-level cache, a nuyor system crostl, itiio the 
prot^essfjr; wliicii had never heen done before in an HP 
n\icroprocessor 

• hitegrate the nieoiory and t/O eoiitiotlcr (MIOt ), creating a 
system on a chip 

• Reduce^ chip art^a t o lower cost 

• Add retiiindancy to the SRAM anays on Ihechiii. allowing 
Sitinv prtxess defects to tie rer^airet), diereby saving ctups 
that would otheruise he thrown out 

• Provkle high -i overage. higJt-speed scan tcstii^j^ ro lower the 
manufacunin^ cost of fh<» professor 

Designs Leveraged to Minimiice Jiiut* to Market: 

To reduce the time to market for the PA 7:MnA\ the CPV 

physical design tcNui dtM-ided to leverage as many dreitits 



as possif^k^ fifjm the PA 71dOL(\ Exce|>t for the process 
shrink from l\MC)S2(i to ('MOS14, much of the siiperscalar 
integiTdata path on die PA 7-ltK)Lf was leveraged houi the 
PA TICHJI^C uruluinged. Also, n^ativ of the cells used in the 
Integer data path were used in other data path blocks on 
the rlui>- Aldiough some of the circuits were reworked for 
speed imjirovements. the noaiing-point unit was ajso highly 
k^veragcd trom the PA71tK^L('. FuillunTUurc, the tloating- 
ptnm unit was usetl hi the geometr>^ accelerator chip for tlie 
Visuali/.e 48XL gi-apliics product . This leverage slrate^ not 
only hel|)etl reduce tunc to market, hut als*> split the design 
costs associated ^dth the circuil hetweeii the ASIC and the 
VPW 

Control Blocks 

Wliilt* :rll of the controt lihH-ks leveraged (ronr tlic FA 71t)UU' 
n gutted some chajiges. nnich cjf the original control logic 
remained intact or was at ieasi srmilai' to the original ccnle. 
This rncivided the opportunity to start the physical design 
e;nlv, pnnklin^ the desijjtiu is with the i hanre to work the 
Ipu^s oHt ottlu^ tool tlow\ begin compositinn. and provide 
early feedback on dilfieuh timing paths lo tlio coirtrol 
ck^signers. 

Forphysu-al circuit layout, theiontrxd physical team iny 
tially used data scak>d from the PA TKHlI.C in l)ie ( Mf )S2(j 
process to die CMOS 1 4 process. In several easi»s. tlu- llnal 
arl work was atnnjst erUirely based uptju the n(K>r plan 
scaled from the PA 7HK)L(\ In other cases, the control equa- 
tions were either vastly differerU (niernory I/D (*ontrol) or 
entirely new (the cache controllers), so we were irnatde to 
lake advantage of earli(T work. 

In the case of the three main integer cf)ntrol Itlocks. the 
timing infonrmtion and a signincant porlion of the control 
equatkais were usable. However, a study r*f interconnect 
between ihe three bkx^ks jjifiicated that they c(Hrld be t (xm- 
hined into a singk* block U) simfjht> the rlesigu frcjui a timing 
standjMMnt and to use ginbal naiting n^smui eselTicienlly. 
ISy moving sever a! hnrulred sigrrals away from the center of 
the die into a more k)eah/.erl area near the intej^er data path^ 
we also sa\ (*d sigriitu mtl aiCtu 

Core Logic Library. While mu*ii tit the h*gical design of the PA 
7:i!M)lJ' was k^veraged from the PA 7l0t)b(\ nrosr of the stan- 
dard cell libraries w(*re borrowed from the PA 8t)U0proj*'ci, 
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FiiJ. I. Thr- [jlif>T(initrrngni|ih 

shows a l:.v^li^al FIB [Tncuw^d inn 
bpaiTi) rpptiir. Fnr this FIB repair. 

|)nr1ioius of t lie circHiit. of Mernsl 
wf^iT iociitefl iiriflcT a iiiPlal four 
jKjwprlujS- Tfiprt'frjro, fJi>eniiij*s 
IkiiI ki bp nit ftiroiigti Hip ptmtM" 
biis U) m'v.i.rss Ihp cirf'uils hcMow. 
Nrj|irr> haw ilip fijatmiun depns- 
ilpfi by llip [If^ runs ovf^r lop 
nf iltp nwUi\ ffMin sprif'^^'-'i'^'J ^>y 
pjijssivaliuiL 



Tlip PA 8000 was labrifalecl using llie fiajiie IC inort^ss Irciv 
iinlog>' a^ Uip F^\ 730ULt\ but was fanhfn ^iloiig \u \\\v fiesign 
ryr lo. T]w PA 73(M)LC' learn w£is able* ro ti?^ almost I be entire 
PA 8000 cove logic Ubmry iinc* hanged. [Tnforlunately, a tlir- 
fcreiit rloekingsitate^ mean I thai tlie driver libraiy needed 
significant rework. 

Standard C&lbBased Control Block Design. The use of a nlan- 
dard rell-bast^d di^sign fm \hv conlroJ bint ks. whit h wais 
leveraged from ihe PA 7100L(; billowed great nexibilily 
wben Jlxing nrnelional bii^s, both in pbase one (jiresjlit^on) 
and in phase two (|Kis!sjliron). During pb^ise one, the stan- 
dard cell ap]3roach permit teti fairiy quick tiiniarf>iinds of a 
control block for rather complex t hanges. Often all that was 
ref]nired of the pbysica! designer Wris to rerun the synthesis 
anti rr Jilting fools, apply a few band changes* and verify the 
design. 

Use of Spare Gates. During the very I ale stages of pbase one 
ami all of [ihase two, the use of spare gales in ttie standard 
cell t>lockH allowed the physical designers lo make logical 
cJicinges l>y cJianging only the metal l^iyers. One very late 
bug fix was tuac^e betw^een the time the lower-level ntasks 
(e.g.. difbrsion, wc^ll poiysilicon) mid the higher level metal 
masks were relectst^d lo tin* mask shop. Additionally, when 
phase two Inigs \^ ere foinul, we were able to nse the sfiare 
gates for metal -only changes. Because a number of w^afei^ 
were held in the fabrir atirm .shop before Ml fthe lowest 
le\'(^] f)f metal) was placed, meial-only changes were nui 
through tlie Fabrication sho|) ver> quickly since the lower 
layers vv er e aheady processed. 

FIB Process. j\notlier advantage of the metal-only chiuiges 
was exploited during phase two. As control bugs wTre un- 
covered, we w err able to rewiie the lf>gf(* using st>are gates 
and the Flli (focused ion beam) t'^'f*^'*'^'^ Hie FIB process 
uses an ion beam to cui and expose various metal hues on 
a funclioual chip and to deposit platimim* reconnc^cting the 
gares into o new- logic stnicture. A topical FIB repair is illus- 
trated ir\ Fig. I . I'se of the FIB process allowetl the design 



team in verify bug fixes in a system that often ran at bill 
o])erating spt^ecL Tbi.s resulted in a more complete tunciional 
\Tjification, since tests run much faster in real silicon t han 
in simulation. 

New Tools, \\lhlethe synthesis tool (vSynopsys) and routing 
tool (C'ell-3 from Cadence Systems) were the same its on the 
PA 7100LC project, newer versions of these fools witli addi- 
tional features imd problems were employed, 'flu^ ability \o 
work with the tools at an early stage allowed ihe i)hysical 
contrf)l tiesign team the chance to leain the strengths anfl 
weaknesses <.>f the ti >ols, so that they could be ex^i lotted or 
compensated I'nronce hili fimiti finality was reached in the 
control equations. Hven tlKJUgb new versions of these tools 
presented a few new problems^ the basic method of opera- 
tion w^as the same as for the PA 710()L('. Thus, use of these 
ttKils heli>ed ret luce our time to maiket by leveraging our 
l>re\ ious experience with tliem. . 

Phase One Quality Equals Reduced Phase Two Debug 
Time 

In addition to leveraging designs and methodologies, correct 
balance betw^een the time and resources spent ensuring 
})hase one f|uality and the time and resources spent finding 
functional problems in ]>hase tw^o can also reduce time to 
market. In this tnoject, we gave great w^ei^ht to ensuring 
phase one quality, since this would make debugging much 
easier in phase two. In return for our investment, the 
PA 7300LC had one of the shortest and smt>ofbest phase 
two periods of all CPUs designed by HP 

Debugging TVade-ofTs 

In pbase one of the design cycle, sinuilation, emulation, and 
hand mialysis are the key tools :»f the designer. With these 
tuols, the designer can examine eveiy detail of Ihe design ai 
any chip state and tmder any conditions. 

In fihase 1 wcL tests can be niii much faster on a real chip 
than in simulation, accelerating bug detection. However, 
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nxji-rause analysis of problmis is a sltiw aiicl difficult pro- 
cess bee aiise virtually all signals are hkiden frum tlie scni- 
tiny of ihe designer, in ajcidilion, the signals iliat are avail- 
able for the designer to examine are either chip L<>s, or are 
onh^ Lndireelly aviailaljle rhrough scan jiatiis. Hence, ekn^irical 
plienomena such as gliu hes and power supply drcx*ps are 
not eiLHH>' obsenedTherefore, jihase oue debugging i^ a 
niiicli sinijiler p^oee^«s beeanst* of rhe availability uf deiaiied 
dat^ abuni the intenials of the cliip. 

Cr OSS-Checking Designs to Improve Phase One Qu^tj' 

To ensure i>Iuisc oni- iiualiiy. each design on the PA 7:JO0LC 
wius subjected to a series of conipuierized automatic design 
checks, and then a set of manual checks was performed by 
designers who were nol involveii witii the origmal design. 
These cliet'ks kit^kt^i at: 

• (Ircuil lopokigies 

• FET size and conne<'tionw 
■ Wiresiste 

• Power routing 

• Clock sign^il routing 

• Signal couiJiing 

• Signal ty]>es anci timing. 

Automated and Human Checks 

For aiitouuited (esis, ihe eomp liter applied Ltie siuiie tides to 
each tiode on the cMp quickly, witliout tJie bias tliai a human 
may have hutl However, the computerized checks often 
generated H(>iirious error niessages ajid required significar^i 
human intervetition to identify the real probJeih?^. Also, iiuuiv 
rules were beyonti ti\c scope ofcoiniHiier algorithms tuui 
rcquireti hunuQi checks. 

An exainpk^ of a siinpie computerized checl< used <jm itie 
I'A 7;30OL(' is tisi^nid etige rate check. Evei'y si^i^nal on Ihe 
chip was c'iu^rked against a .set of criteria tlui! tlepended on 
the signal ronlext. Tlu* f-onriniler program iiliiully rej^orted 
any signal that vi(K|ated die speeitu-ation set for lliat type of 
.signal. It was the joli of the designers lo detennine whkh 
errors fjaggerl by the eonipnti'rized eherk were real prob- 
lems, 'f he designers I lieu fixi'd n'ai errors jmd waived all 
others. Obviously, with tliis and all other aiitornatic checks, 
a certain amount of skill and experience is needed tojud^e 
wluit consthutcsa potential prot)iem and what do(*s not. 

Because some quaUty checks dc^ not lend ihemficlves easily 
t(j romputen/ed cheekjj\g. each ceil, subblock. ;ind rrLijor 
block of the (Ft ' hati To f>e examined by an experitMued 
engineer w1k> was not the designer of tiie Ijlock. The eross- 
(*hecking engineer liad a list of |*uidelines to follow for 
checking each desigiu and any variaiu;e frotti these gnide- 
hnes WHS discu.ssed with the (k^signt^r This eberklist was 
broken into eategorie.s so that tiie cross-t hetkhigiHigineer 
could focus on one particular area at a I ime, sucli as sche- 
matics, artwork, test, and so on. 

An example of a check (hat is not auKmiated is an artwork 
cheek, which ensures that all circuits have vco' solid pow**r 
and grottnd networks. The sirbji^tive nature oriliis eheek 
makes it very tlitOcuh xo implement with a computer eheek. 
Also, because of the subjective nature, the checking enj^lneer 
must be viHj dlligetil about what constibit(*sa solid sojiply 
net and wbai does noC 



Circuit Timing 

Ulien operating wiih a ck>ck period of only a few nanosec- 
onds, tinung is of utmost importance as a phase one (luality 
issue. Several fiifferem looks were usetl to ttiis end, tu<*st 
notably Cadences Veriiime, EPICs Paihmill. and HP SPICE 
(see Fig, 2 K 

Chip Model Tools. A Veritime model was generatetl and main- 
tained (or the top level t>f the chip. This model inchsded 
either gate-level de^cript jons of blocks (generally for the 
standanl eel! blot^ks) or black box descrii>iions of blocks 
(for the ciLstom data path blocks), as well as models for the 
delay due to the imerccimiecls between bkK:ks. On a regular 
basis, the timing teaii> update*! the model imd t>erfonued 
timing lumlysis. The residis were then given to the various 
block owners, who redesigned slow portions of critical 
timing pat Irs. 

HP SPR'E antl EiPiC's Pathuiill were ustnl by a muuljer of 
the custom data path designers to generate black tiox mod- 
els of their t^loeks for the global Veriiime model. Also, scmie 
desigircrs analyi^ed laiger standard cell bltM^ks with Veritiiiie, 
Additkmally» a tool was developed that estimatetl the <ielays 
of all signal routes, whkh coiikl then b^ hand-checked for 
aiioniaties. 

Finally. IIP SPICE was used extensively lo simulate the 
timing of all nu\jor liuses. many top-level routes, anrl other 
timing-critical paths. All elements of the standard ceil 
libraries were also chai'acterized witli HP SPICE, using 
ccjnsenative [larameters. While this apf»r<»acb canst^da lew 
more phase one beadat hes for the ctjiurol designers, we 
uuctjveted no liming issues for statvdard cell blocks during 
I jhase t w o c li ^uac t er izal ion. 

Chip rnmposjtion Focused on Minimizing Cost 

One oT \ Ur ways ibai we were al>le to (iri\ e d<iwh the cost at' 
systems that iarorporate the PA THtHlLC was to rc<iuce the 
die size, t liereby allowing more die per water ui falnication 
ajul imprf>ving yield. This was a key fotnts t)f the physical 
design group, and resourees wiT(^ dedicated to monhorin^ 
the iinjm<1 tjt all changes on the manutacturing ciisi olMie 
ehij). We took several steps dining [jlvase one to ensure that 
the PA 7;S(H)LC was as small as we could n^asonably expect 

Global Floor Planning. We starled global lloor plannhig and 
ronling early in the tiesign phase. Our initial lloor piaus. 
althnugh they liear liltle res(Miil)lance Uv tlie fiiud ebip floor 
plan, jjrovided tire groundwork lortarly estimates on die 
size and feasit)ility; One of the early decisions was whether 
we would use three layers of luetJtl, as oti the PA 71(Jt)LCV or 
add a founh metal layer. Mier extetisive analysis, we con- 
elmJi-i) that, with only three rm-tul layei's, our stepticr size 
vvouki linut us to a very small 11 rsi -level caehe, whit b would 
not meet our performanc*e targets. So, we added the fourtli 
metal layer- As it turned out. the fourth metal layer was 
essential to the success of the project for ruany other 
reasons, even lliough the decision was made over a year 
biTfire tape releiLse. 

As the design matured, the flr>or [dan and routes ke|>t up 
with the changes and provided feedback on die size mul 
potential timing r'J'(*'>J''nis. Pig. :] shows the final lloor t^hi^"^ 
We made several m^jia' roinpositiixnal t hanges early stilely 
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to renu)ve cojige^slioii on iiu' Lop niolal layer aiul lo compaci 
dve die area. These compositional changes included move- 
mt*n1 of a hlork in the data caclu^ and chart^in^ the tu^pef I 
ratio ot our pail-tiiig iiiLsUte. 
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- Space Registers 

- Trait^tation Loflkaside Buffer 



Using the "Dirty Trick." One of llie oi:>poi1iiiiiiies wt^ saw was 
m the composition of the data cache. Otiginally the data 
cache was designed to he^ cran[>letely j^yiiHnetrienh with a 
righi mk\ a kdi side, and a diiXa palh in ihe middle lo merge 
the dala IVom the two side*;*, Tlie design essentially had three 
l>l<uks tjii each side stacked from IxjtKnn to it>p: the data 
RAM array, the tag RAM army, and the tUity bk)ck array. 

As we started routing the chip in its e^'ly phases, we saw 
that we had much more routhig congestion in the c hainieLs 
above the right side of Ihe data t ache than aliove tlu' left 
side. The cliannels afiove the right side led to the integer and 
floating-point units, while tJiose above the left raji to^vaids 
the niemor>^ and I/O controller (MIOC) (see Fig. 4a), The 
congestion on the nghl side of Ihe data cache would have 
iiictea.se(J ihe height, while leaving unused area above ttie 
lefl side. 

To deal with I his congestion ]>roblem. wo employed vi ha! we 
called tlie "iluly trick." The diily bit block in the data cache 
is used lo stcire one bit of infonnation for each line in the 
cache. Tins lill fells llu^ jmjcessor wlieiher ihe infonnation 
contained in I ha! eache line has been niodiHed hy I he CPl' 
and is therefore diny. In onr otiginal coneepliont each si(ie 
of tlie cache had its own iiirty l)lock, w hic}\ {-ot^sisted of one 
bit of infonnation per cache line and an address-to-cache- 
line decoder, the latter being ten times larger. 

By putting !>oth dirty data hits on the left side of the data 
cat 'he and shiuhig oj^e address decotier. the left side of the 
data cache grew hy one hit of infonnation, lint the right sifle 
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shrank by one bit of kifomiation and one address decoder 
(see Fig. -lb). This wajs a l>ig win iLs il /jJlowetl the flie size to 
be slinmk by th<' hei^lil ol' iht* ditty block. If we had tiot 
floor plan J led aitd routed early in die design phase, we 
would not have seen this opportunity in time to act on il 
antl reduce ihe ciie size. 

Outer Ring of Pads Limited. We were not in the clear yet, how- 
ever. Altt*i all of this work on driving down the die size, we 
entered ai\ interesting situation. Even though we were able 
to stinnk the core thjiiensioiis, we were now limited by tbi^ 
outer ring of pads that connect the die to its package. This 
was not at I issue earlier, since we had fewer pads and a 
larger (*ore, but as the project progressed we added pads 
and shrank the core until we reached Ibis predicament. 
However the I/O ring leant elongated tjur bit slice iti the ring 
slightly and narrowed the width considerably, allowitig us to 
reduce our die size until we were once again limited by the 
size of the core. Again, this Irade-off on the* t>it siit e chmen- 
sions was not readily appiiretu at the outset of the project, 
but was nlmously a Itig witi when wt* tuiatyzed the situat.iotr. 



Mela I- Four Ro tiling. Tlie last obstacle came after we finished 
a 111 ( ) maf I ' d signed ro 1 1 1 i ng , Tb e n >ii te r we usee I , I L\ R f V w^as 
designed for the Unee- metal -layer process used on the 
PA 710DLC atid so it was unable to automate Oie fouttlt metal 
layer It was a channeMjased router, which Edlowed the block 
designers to use all three metal layers withiti their block 
bomularies, hut reqtiinHl us to leave* areas free l>etwei'n tlu* 
top-level blot^ks for the interconnect. We usetl MARF^ to con- 
nect signals belweeti top-level blocks^ bi;t we left nu\jor 
buses, power connections, clocks, and speed-critical signals 
for the hand-routed fourth metal layer. TIris nteanu however, 
that any layer-foiu metal used witliin the blocks could irUer- 
fere with the global metal four, which we plaimed to run 
over the blocks on tbte chip, not merely in tlie channels. 
Therefore, from the outset of the project, ntetjil ftmr was 
101 der the ownership and control of the composition team. 

■ HARf' iHewlett'Packarri Automatic Routing Pfogfaml is an inlemaf rouiing tool thai viias 

leveraged from the PA 7 IODIC toolset 
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The cache an'ay desi^ers were given full control of layer- 
four met til in then" areas, but all other block designers pro- 
ceeded as if they only had three rnelii) layers. As die gloljal 
metaj-foiir door plau uiatiired. rud^lal loui- was relca^jetl to the 
block owners to reduce aiea in places that did not c^tjnllicl 
witli the global route, hi ^dl other ceases, i\w block owners 
were eojisi rained 1o use tht^ lower thret^ metal layers Lnslt^ad 
of jjlaein^ obslnjctions to ( he globed metiil-fouj' route, eviii 
if it meant growing Uieu: blocks. 

This stingy allocation of metal lour betrcune very iinj>or1anf 
as new buses ;ind liniing-critit al signals were promoted ui) 
to the metal-four "superhighway." Ne;ir tlie end of die |jn>i- 
e(M. the t^onnection of tbt^ met^il-four f>ower liuses to tht' 
top-ievel blocks became more and more challenging, and 
would have been unpossible if not for tlie freedom retained 
by keeping metal fom^ cleai' of obstacles. 

Leading the Oexibility to make last niimite changers was criti- 
cal to meeting our die sixe conunioncru. Since, at thai point 
In ibe project, our packages bad biM'u ordeied with a st>eri- 
tm\ die cavity, clumging woukl have had serious ilnaiicial 
aiid schedule imphcations. 

Practice Runs 

The PA 73(KJLr team used sevenil tec bni<iues lo ensure that 
tlie project wcuild proteeti as smoothly as po.ssible. These 
tecliniques included building an SRMI test chip aii(i doing a 
mock tapc^ release. 

[Ising a Prototype Chip. Before the PA 7mnx\ HV iiad never 
pr<)du( t d a CP[' witii any sigjiilKant auiniml <if orecliijj 
nu^ mory. 1 1 o w c o ukl w e e n s u i e 1 1 lat t ] le cm *t i e wo u 1 < i w o r k 
in first silicun? Widu>ut a working carhe, iimning test code 
in an actual system woukl not be practical. To hel|.i ensure 
a w^orking cache in first silicon, we designed and built an 
experimentid memoi'>' chip, featuring various RAM tell 



topologies. This test chip pro\ided a large amoimt oi\isibil- 
ity into the workings of llu* RAM cells. It also proved li> be 
an excellent !<iOl fur analyzing the workings of ihc on-chip 
c af;h(\ Because the i^AM dt^sign was eireelivt^ly ■"ijliase two 
vcrilunr during pb^ise one of tiu' CPU design cycle, the 
PA 7t]00LC tm-chip cache worked in first silicon, greatly 
casing the tinu* and resiairces retjuirerj fur phase iwtj debug- 
ging nf the rest of ttieCPU. 

Mock Tape Release. Tape release of a C'PII is quite a cumpli- 
cated r>rocess. invrjlvijtg several straps of database coi^ying, 
vf^rifiealiont translaticju, and data txansniission. Also, it does 
not lend itself to leveraging. Any one of these steps ccjuld 
cause a delay of severed <lays in fabrii ating tht' riiip. Tlune- 
fore. we performed a mock tape release m which lach step 
was executed as if it were part of an actual tape releiise. The 
only exception was that the data used was not the final, fully 
designed CPL', When tlie time came to do the actual tafje 
release, the f Hoceys went veiy tuiickly iual smootJdy. 

lSRAM Redundancy Improves Yield 

Witli abuiM eight niilliun of I be nine milliiai transistors on 
die P.A 730nLc:\ the i arlu- is tlu^ most likely block on the 
chip tei fall victim to a fabrkatit^n defect. Therefore, we 
addi»d redmidant liiocks of four columns each in the SfiAM, 
so that a block that contains a fabrication defeel can be re- 
p bleed with a funetional blui k via a set of rnuhiplexers. 
Fig* 5 iilustrali^s ibesbiftetMttork metliud used in rt'phu'e 
the defective block with a redundant block. 

The select logic on the niu hip lexers shown in Fig, 5 is con- 
trolled by a fuse that can be blown with a laser to deselect 
the failing blork of c ohunns and select an aiijacent lilock. 
The ac^acent block's mulli])lexer nmst nlso be pjiigranuued 
to select thc^ next block, TJiis ripide continues until nne at' 
the HHlundani bkM ks has been sobslitnted into the liAM 
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Fig* 6. 1\vo of the sprial Riinibpr fuses in the photon^LTOgruph uln.iM 
I lent' lit^en hlmvit by I lie kisen The ullipr ts^tuin^ tt^fl iittacl. 



iiiTMv. By siil>stituriiig mi atljarent blotk rather lliaii imnietli- 
jil^'ly SI I bsti tilling the ivdimdaiit hiock \ov the dcretUvc^ 
Ijlork, timing rhanges an* mmimizcd, 

Adding a Serial Nuaiber 

ihw nf ihc new iVatiires iruoiporiirpd on tlu' PA T^RIOLC is a 
serial miinber jjulividually progianuiieti unlo eacii die l:>y tlic 
same laser thai progi'ams die redundancy selection niulf i- 
plexer^s for tlie on-( hij) ( aclie vSRAM. As wafers arp put on 
(he laser for eac lu' redundaniy prngraninuiig, we are aljlt^ to 
t>]ow like serial mmiher fuses al the same lime. Tlie serial 
niiniher feature was added to the pnnluetion flow wilh veiy 
little overltead. Fig. ti shows a set of serial number fuses. 
The s(Tial number was aflded to the l^A lAOiiLC lK*eaiise; 

• It provides (he aljiHty to track miy given dip back lu ils origi- 
nal loti wafer, or die designadan, so w^e can aIlal>^Ke inh)nna- 
tion gathereil on the die at wafer test and at, in it ial packagi* 
test. On previt:jus niicroprt)cessorH, w^e w^ert^ unable to track 
backwards in this f^ishion. 

• It allows the design leiuii to select specific dice oft a wafer 
without ha^ang to remove the whole w^afer from the produc- 
tion ilow. Tltis makes it mucli easier to grab inl (^resting parts 
for further chaiacterizatioii. 

• It i>r<}\1tles a convenient way to jclV^r to and classify produe' 
lion parts. The serial namticrs became an uivahiable pan ot 
the phase two debug effort., bec^ause we were al)le to know 

1 1 le \ \ i st f >iy < » f I h e p 111 1 w e w t ^re 1 1 el i u ggi ng. 

1 1 ig h - S p e ed , H ig li -( ' c >ve rage Te si i ng Red u<ed 
Manula*! uring ( ost 

Bcidi bicjiidsidi* piUiillel antl serial sc£m tests were used to lest 
the I'A 7ltHJL(\ Many of these lests w(*n* lev(*ra|ied for use 
with the PA 7300U'. Some tests were siin[?!y copiett from 



the PA TlfXHX test suite aiid reformatted for use with the 
PA 7300LC. These tests includeti legal PA RISt* assembly 
cocie tor parallel \e<'tars and serial scan rests of highly le^er- 
ageii blocks, such as the integer data paUi, 

Other tests required small ch*inges. For instance. TLB lesLs 
on the PA TIWIX' invol^eil writing and tlien reading a x-ariety 
of values for each TLB enii>\ Then the tesl simply looped 
through ttiis process for each of the i>\ entries in the TLB. 
Thus, to test tire PA 7:3dl)Lt' s OB^ntry TLB. we merely 
(hanged the loop value frcim 64 to 96 entries and refor- 
lualte*! the lesi 

Automated Test Generation. Ulrile niaiiy of the highly le\'eraged 
nisronr data path bloL^ks t*ouUl use scan tests levei-aged froni 
dre PA THWIX'. this wtLs rrot the v^se for iJie logic-svntliesized 
stairdard ceU blocks l>ecause £iiry logic change rendered the 
old tests useless. Fortmiately, the iLse of air auronrated test 
generation tool allowed the PA TrjlXJLt " rearrr to irave a signifi- 
cant portion of the serial tests written before we received 
fir^t silicon. Shoilly thereafter, we completed Ute res! of the 
serial tests, with high fault co\'enige. The <uiitr(ii t>lock test 
efforts were a! so helped liy widespread ust* of state memory 
latches which were conti'oIlabJe aird observable via serial 
scan testing. 

Manual Test Generation. For custom data patli lilocks that 
were not le\ eraged, such as those in tJre MIOC arul cache 
controller I docks, bloek designers wrote tests by haird, en- 
suring that each tniosistor iir their desigir would be tested. 
Often, this daunt iirg task was aided tlirough the use of l^erl''' 
scripts l<j trelp genei^ate the lest vectors. Tirus* many circuit 
desigiiei^i found themselves becorrring part-time software 
(k'velo|ipis until dteir I dock tests were written* 

Venfying Block Tests. As block designers began generating 
serial tests, live abiliiy to verify these tests became an issrre. 
Sinrrdatirrg a single block lest oir a model of t tie ctiii) woultt 
take anyw here IVtjm a few mhurtes 1(3 several hours. How- 
(^v(*r. a real chiri e(uild run even the largest test in just h few 
seconds. TiieretVue, a way t(J v(*rify the block tests ou an 
real chip could save a lot cjf simulation tune without ann- 
liroiuising test quality 

Hijwever. the tester's Listed U} test tluse chitjs ui niarrufact ur- 
urg were niit readily available. Furthenivore, they were too 
expensive to use Tor this putijose. Sinee we were ninnmg 
serial block lesus, we oiily needed to ccaitrol the chip's serial 
test port pins. Tire other chip I/O pins could be tied to 
groruid* 

Foriunatelyv we had <iecide<l to make our serial test j)ori 
comrdy whh the .JTAG (IEEE 1 149. 1 ) industry stai^dard. This 
meant that a relatively inexpensive test |>oi1 iirterface was 
readily availatik\ We piircbtrsed a JTA(J bidiisliies PM :]720 
and built what we called a **bench tester'' iu^otmd om^ of 
rhese interftK^e.s. F'ig. 7 shows a bloek rliagrain of the ijeirch 
tester, 

We fed I he chip power and controlled the reset puis widi a 
couple of <>ld IIP fUJOi^A d<' })(jwer su|)plies. Tire system elork 
was provi(ied by an ttP8i:MA [julst^ generator. Finally, iUl of 
tlrese comiJOireuts were controlled via the 1 IP-IB aarl con- 
nected to the lab's coni|)titer netwrak through au iirMlbOfl 

' f^crl iPracucal Extfdi'iiofi Repnti Lunyjjagef ty dsJEiigfT-ed to tjandle a vanety at UHIX"' systfini 
administialive furcltrjns 
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Mfjflel 74n iiitiiistrial worksUitioii. Thr nnwork cnnitf^clion 
alidwed rjeyigiiers lu rim teiyts ^uiti niomlor rt?siills I'lgiii 
their desks, or even Ironi home. 

Many block desSigners pushed the beiK^h testers well heyond 
their original intended use. By ihe end of the project, they 
CQiikl ln' iiKed to cnv^re vollagv-versus-rrt^qnency slunno 
plots lis ihv rests were i^x{*riited over n range nf powvr siii)i)ly 
^ind clock frequeney values. We even engineered a way to 
execute a loop of code in tlie instriic^tion <;:arhe %vlth no 
oilier sysUnii Bupi>oi1 logic, proving that the PA I'ddOLV is 
truly a ^jyslern on a chip. 



Summary* 

In conchision, tiie PA 7-:It)0Lt' design team ow^es much of its 
success to pie\4ous project teams. Our aggressive time-k)- 
market goals were met nt>t only because of circuit leverage^ 
hut also heraiLse of niefhodologic^s from previous projects. 
Alst>i ;ui early fo(^us on quality jjita enled a lot of rewcirk at 
the end (jf the project,. Excellent, perfonuance from this 
highly integrated processor gives MP a competitive advan- 
tage in the cost-sensitive, perfonnance-hungiy markel foi 
wliic^h it was desig^ned. 

UMIX IS a r&gistarad uademsrk in ihn United Stciresand other cguntnes. Ii4:ens^d e){4:JLjsi^^3v 
Ihmuflh X/Open Cunifiany Limited. 

X/Opeii \% a registfirad tiadenriarK and ihtJ X device is a trademark of K,<'Open Companv Limited 
In rhe UK and oEhar couiitfies 
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Verifying the Correctness of the 
PA 7300LC Processor 



Functional verification was divided into presilicon and postsilicon phases. 
Software models were used in the presilicon phase, and fabricated chips 
and real systems were used in the postsilicon phase. In both phases the 
goals were the same — to find design bugs and ensure that customers get 
the highest quality part possible, 

by Dimcan Weir and Paul G. Tobin 



Ensuring Uie t'orroctness oi' tht- eoiuplex FA 7*iU0LC' Ue.si^n 
rtH|iiin'tl an extensive veriiicatian eff(jrt. We wan let! to en- 
siirt^ tliiit no customer wonki ever em^fuinler a ilesign hug. 
To iTarli tliis goal, we set out lu exercise Ihe (iosigii inure 
exti^nsively tluin is drjne wilh uner sofiw<ue. Prevlcjiis ilP 
processors have luainLainetl a well-emiieci reputation for 
quality, and we wanted the PA 7:^<K)1.C to meet or exceed 
I lie quality of its predeeefs^oi^. 

Tliis papier discussef^ tlie methodolog^v used to verily The 
etirreetness of the PA T'JOULC and the dui^niosijf hiuclwai-e 
incorporated into tlie design lo siipi>ojl (leV>ngging. 

FunctioiiaJ Verification 

Tlie fntictional verify at it in efTon was dKidetl info jiresilicon 
and J HJSl silicon ]»hases. The presihcon phase invohed <reat- 
inga software model of (lie t hip and an en\ in "anient in 
wliich tlie nuKlel conUi l»e llnirou^ldy lesietl antl d(»hngged. 
The modeling envii*onnient providefJ inmiy features to aid 
verificaliun iin Indin^ the ahilily lo iialialize Ihe nvaehine 
slate, i Inject sliinuh, and see into all poitiuJis t if die design 
r<a fiebugging, One nuyor drawhac^k of Uie nujdeling en\i- 
ronmenl was the slow sin tu hit ion speed. 

Complementing the presilicon effort, an extensive post- 
silieoii verification progr^uti was con^pleted tliat took ad\;tri- 
lagiMiT ihe test throiighpnt available wlien running on an 
aclnal < *aripnter. 

Extensive testing of the physical cireuil (ienign of the 
PA 7^1()0Lt* was done in presilicon and |iostsilieon environ- 
ineuls to ensure that the circuits wouiti meet frequency, 
voltage, and temperature target,s. This loivic is covered m 
Uie article on page 61. 

Presni(*on Verification 

hor heller effic'iejuy, we eliose to divide the design of tlie PA 
TMtlUht ' info two ctiniponenis: the CPl " core and the memfa>' 
an<i l/( I cttrtt roller ( Mh )( '). These two poriiiais of die design 
were logically sepaj^tUed hy a welKltKiNnented interfaet^ that 
enal)led us la veiify each component independently. Verifying 
thi^ two t'omjxjuents indetH^nderUly [uovi<kKi stneral Ijenefils: 

• Si I laJler an d t ^ isi t ' i j n < > t ii * I s 

• Precipe control over the slinnili al Ihe I'Pl -\H< )(' inlerface 



I Simpler model nianagement (because less coordination was 

neeikHl) 
' Reduced debugging tiine {since it was known which portion 

of the tlesign coniained \\i^ bug J. 

;\s the design neared cfimplelion and both the CPl' anrl MOC 
had been extensively \'<^riried. we created a single merged 
model dial includetl both components, ThLs proviileii a tlu)r- 
(5ugh check of the inlerface between the components and 
was a double check r>f the independent verification work. 
In addition, the MKK' was incoiporated into a mo(kd with 
external l/<.) rtevices to ensurt* tliat the PA 7M10LV design 
would work with tin* components necnletl for a complete 
con ttn iter system. 

The presilicon verification envaronnient: consists of three 
paits: modeling envirnnnienl (model), test c^ase envirorunent 
tslinnilil, and r bet king environmenl (checks). 

Modeling BnvinHuneiit 

We modeled Ihe PA 7:|IKILC design using the Verihjg hard- 
ware di'script it in language*. TlitMlesigu was prttnarily modeled 
at die logic gale level wilh conuet liviiy exrracted Ironi Ihe 
physical design. Stjine key poriions of the design like the 
caches, TLBs, and noalingijoint execution units were 
nio{lel(*d at a higher h-vel to improve the size and speeil of 
Ihe model. 

Fig. 1 shows llie CPl" and MltK' modeling environu\ents. 
Softwmx' emulalrjrs were conned ted lo the mtjdt>l interfaces 
to provide injiul and respond to output from the model. Tht* 
programmable nature of the emnlaiors allowed test cases to 
exenis(^ llu' nil el facets fnlly. 

New Modeling Process. Managing the modeling environmem 
of a large design is a liineH onsuming task requiring corjr 
dination among all team memtjers. Priiblents with a ntodel 
build could lead lo downtime that would stall tlte veriftca- 
tifai cffoii\ To minimixt* these problems, a new mot lei build- 
ing ]>ro<'ess was iittplenuvuted for the PA 7:J(HilX' dt^sign. All 
bliicksnf Ihe inotleling environment wimc placed under iwj- 
sion com rot Any chajiges bad to be includeil in a ]jrocess 
change onl(*r that tiocumented the puq^ose of the change, 
the blocks iLffech^d, llie (h*pendcncies existing bi-'lvvecn this 
and other process change orders, and ihe testing neciled to 
Vi*rify the change. In atldition. an auhimafed mofh-l build 



.liiric" IMftT I IfwU^a-Pjk kftnl Ji n irrt.il 6^ 



)Copr. 1949-1998 Hewlett-Packard Co. 




CPU 
Emutalor 
□r Model 






■ 


Mairr 
Memory 
Emulator 






S Bcgnd' Levis 1 

dache 

Emutamr 


■ 



t/OBus 
Emylalar 



111) 



Fig, 1, i'r^t silKfHi vfTlflratlfUt nnxlplingprivi^imnnifnits. (a) npH 

niuriclJng rnvironntont,, fb) Mcniuo mu\ 1/0 cuiil.rtjljer (Ml(X') 

morlelmg t-'H vi njiirnHi [ . 

procedure was put in place U) uWnw designers to integrali^ 
theii- changes into a privale ropy oF llie mode] and verify 
them in isolalion l>erore .snbrtniliiig a jacicess efuu^ge order. 
FinaJly^ l>efoi'e a model was relejisefl to tiie verillcatioii teiuii^ 
it would undergo regres^^lon teyting to elintinale biatatil 
errors. Using llie new system resulted in a consistently stable 
model tliat accelerated the verification effort. 

Test Case Environment 

Test cases conlrol tlie stir unit apt'ln^'d to a niotleli Iherehy 
provifling die eveni iiueractions rhat sire^is die deisigiL Having 
an efllcient way for lesl cases to stress the entire ilesign is 
an inijjortant factor for improving quality. Tiie strategy used 
for the PA 7300LC* w^as largely leveraged from the successftil 
PA 7100LC effort.' It provided a simple w^ay to initialize 
machine-slate rcsomces like registers, caches, TLBs, and 
lnelno^>^ II also allowed high-hnT] rooniination of inshiit- 
tions executetl hy (he CPl' along wKh iransatiions oceuning 
at the imxlel interfaces. 

Te.st cases for the PA 73()0LC canie from I Ince sources: 
cases leveragetl from the PA ?IOQL(\ itew cases focused on 
t.lte PA 7;3( )OLC' . at 1 1 1 rai 1 1 1 o 1 1 1 1 y ge i \ t^t at e f I cases. 

Tliousands of cases that wet e wiitten to cover the PA 7100LC 
design were leveraged to run on the PA 7'JOOLC. Most cases 
neetled no modifications to lie effective because of shnilari- 
ties in the designs of the two chips. For the portions of thi^ 
PA ?'30dLC design tliat were tlifferent, new cases were pi"o- 
duced. Some of these cttses were written to focus on paitic- 
ulaJ' asj)ects of t lie design such as instruction-cache misses, 
the C'PU-MlOr interface, imd thesecoiHi-level taehe. Odier 
cases were produced using random code getierators that 
were designed to stress the FA 7:^00LC. 

Random code general oi^ are mainly eniployed Ibr jjostsili- 
con \"eritkatir^n, hut tlie PA 73d(JLf team idso emphasi^pd 
tlieir use for presilicoi\ testing. Although tdiallenges were 
encomitered, the results w ere positive. Many subtle bugs 
tliat might not have been found initil post silicon testing 
w^ere disco%eretl eaily in the design pit j cess. Random code 
generator also pro%1ded an efticient w^ay of achieving broad 



coverage* with fewer engineers than other testing methods. 
See "Random Code G ener atiotr on page 71 for more on this 
topic. 

Ckecki iig E ti vir oiiine ii t 

A modeling environment and interesting stintuli are only two 
pieces (jf the verification puzzle. The otluT critical piece is 
verifying the model's response lo stimulation. On a complex 
design like tlie PA 7300LC\ with maity designers aitt! (ens of 
thousartds of test cases, it would have het:n iniiitjssibie to 
veri^ correct model behavior without aic!s to aul ornate the 
process. As a result, a significanl part of the PA 7;5()OLC veri- 
fication effort wits spent creating software tuodules (hat auto- 
matically verified the models response to eveiu^s created by 
test cases. 

Modules were etjmpiJetl intinlie model to <'heck that the 
MIOC followed the proper I/O bus protocol and to easure 
tliat bolii the C"Pl J and the MlOf^ followed the protocol at 
the CPl-MIOC interface. Checkers were also written to en- 
sure thai the memoo^ con( roller ol)eyed proper! im in g proto 
r n! on the main met no i>' and st^tond -level i'aclte buses, 

CPU Testing. For the CPU core, wv hnked a PA-RISC archi- 
tectural simulator to run synchronously with the model to 
ensure that instructions were execuled as (he arch ite dure 
requues. When an iustrucdon finisht^i exetu(ingi (he residls 
were eomjiared i>etween (he model and the sinmlatorH A 
special mo<litle t alleti a depiiwr was wriltt^u (o ttanshrte 
internal CPC signals irUo <irchitectiu'al events That could 
I J e ( I \ ec k ed by d I e s i n I a 1 : 1 ( o r. A Her a t est case finished, (he 
I nod el's final mat liine si ale was ctanpajed against thesitim- 
I at o r s final ni ac 1 1 i n e st ate . 

New Transaction Checker Logically, the MKJC converts in- 
born ui t tan sac I i Oils on one interface to oulboiuid traitsac- 
tions on a different iiUerface. For exautplei fite CPU core 
might inilicile a cache line copyin thai the MIOC converisto 
a lead on the nienioi>^ jjotl, Wheji the nuMUory su(>plies the 
data, die MIOC retmiis the cache line t:j tlie CTU. A special 
transaction checker, called the ntetadierker, w'as WTitten to 
verify that proper transaction conversions occiuTed. Tlie 
metachet ker tnalcht^d inbound transactions with (heir asso- 
ciated outbounti transactions, Mismatchetl transactions were 
reported as failures* 

Wew Cache Checker. Tlie cache controllei^ for the PA 7:300LC 
are atnong the most complex portions of the ciesign. As a 
result, a checker w^as written to verify their o]:»eration, l( 
monitored the instruction liijjeline, the cache lead and write 
polls, and the CPU-MIt )C interface. Any incorrect behavior 
w^as d ett* c t ed a 1 1 d i e^io i1 < ' < I . 

Ad Hoc Checks. Finally, a collection of snialk ad hoc checks 
were includerl in our presilicon testhig to cover things that 
might otherwise be missed. Some were signal-lev el checks 
(for example, checking that a set of signtils were mutually 
exclusjvt^), otiiers were special chec^ks requued liy test 
cases. Some cheeked that perfornuuice features such as the 
superscalar pipehne were operating t oiTectly. 

Together, the checkers foiTued a seamless net to ensurt^ diat 
inconTct model behavior would be detectetl Tliere was 
some overlap between tlte checkers. Many (jines a design 
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flaw woultl get flaggeff by several of the checkers, but divld- 

ing the work hefweeri niultiple rberkers was an efTective 
way TO reduce the risk cif a design flaw estapiiij^ detet tioii. 
wJiile allDwiiig verifHaiion engineers to work in p;iialleK 

Modet Matches Physicaf Oesign. <h\ve !he iib\-ska} desigii aiMi 
riie \ eriluaTioii effort sfahijized, we vt^rified tbat the Venlog 
model ni^ched the physical design* Ttus was done by deri%'- 
mg a sw1t<1v-knel mod<*l from the at tiud ehip suiwork aiid 
running lliousands of tests on Lwitb it imd tiw \Vrik>g nioflel. 
romijaring key si^ials on ever>' < Iik k phase of simylaiion. 

PostsiJicon Verification 

( hwe the (Jesign is fahrieated, the naliire of the verifiratioji 
effort ehaitges eomph^tely, Tlie goals are slill the Scinie — 
to find the design bugs and eEisure that customers get the 
highest-<inality pan j>of^sibk\ but the tools and the iijJiiroacb 
are tliffeteiil . 

Test Systetns. Tlie environment for testing the design shifted 
ft f >n I software models to real c onipuler systems that in- 
r ill dec! PA 7300LC chips. We set up a number of test sys- 
tems, each of wliich couhl be contnilled remtHely from a 
host workstation using remote <1(4ingger sfjftware. Tlie re- 
mote debugger provided us witli the ability to load atid nm 
pi ogi;iitis on t be test system aiul to examine porticjtis (jf ibe 
machitie stale. It also gave us rtJinplete conirol over the 
machine without any operating system layers ohstntctlng 
our access to systen\ resources. 

Itt^cause the PA 7300LC^ i,s designed tf) w^ork in a number of 
different sysiem con flgii rat ions, we set upsysiems that bad 
different cUm k frecitiencies. t ae!u^ contlgiiratioiis, and mem- 
vny liming. To eustin^ tbal tlie <lesigii won It i work wilh a 
variety of different I/O vtwds, exercisers for ilie GSC I/O bus 
were created that could cban^ii' their behavior to mimic any 
fyi>e of I/O car<t 

Random Code Generation. Rjindom code generators me an 
efficient way t(> take advantage of the speed rtf pnstsiiiccjn 
testing. Witli a small amount t>f human eontnil, these pro- 
gnmis can create millions of unique tests to exercise every 
nuance of a complex rU\sign. We iiserl random code* genera- 
lion extensively on the PA 7*lt)(!Lt' by emphiyiiig six difh^r- 
enl gt^neratoi's, One targi^tcd llu^ lloaliugi>oint di'sign, one 
was threcledal Lbe MIOC, and four covered the entire chii> 
operation. 

Extensive Suite of Tests. We sut^iilemented the nmdom testing 
widi an extensive suite of tests using I/O exercisers to stress 
die MUH' d(^sign. Many tests were leveraged from [hjsi- 
silicoii testing of the PA TUJflLO ^md were luodiHed for the 
PA 7^J(J()Lt\ Adtlitioniil tests were written to jjrovidc t>ctter 
coverage^ especially for areas where the PA 7300 LC design 
differed from the PA TIOOLC. 

Se(f'Chet:ktng Tests. Tfie elaborate checking nu'thodolog;^ 
lioai |Jiesilicou \( lidcation wa^^ of uo use iu p(jslsili{Oii iesl- 
ing because it was not possible f<ir th<^ cht^ckhig sfifiware to 
obserxHMlje design now embedded on a WJ^l i hip rumiing in 
a system. To (■omi>ensate* all of the postsihcon tests were 
self-iiiet king. The generaloi^* thai ereate^l the random tests 
also ensuri^d ihat tlu^chip respoudt»d [Jiopioly to ibem. 



Random Code Generation 

The complexity of processor designs has increased dmmatn^lJv m an 
effort to imi^jve pedonnance, reiluce systein cost, and allow processofs 
to be used m frme system conhguraiions. Ttie incieasmg comptexttv 
makes it almost impossible to identify the specific event cross-products 
that need to be tested to ensure that a design is conecL Random ca<te 
generation Is an effective method lot testing a design withoul having to 
identify exactly what needs to be tested A random code generaior 
cteaies legal random sequences of machme states and mstryctjons tfiat 
exercise a design more ihoroughly than application software 

The term random is somewhat misleading — generatmg completely ran- 
dom machine states and instruciions would result in uninterestirrg tests 
as far as stressing the design is concerned. Instead, generators focus on 
key aspects of the design while preserving an element of randomness. 
Accelerating rare events, hitting boundary conditions, and concentrating 
on instrucitons that exercise complex parts of the design are among the 
ways to focus a generator The probabilisiic distnbution of random num- 
bers creates interesting combinations of these focused events 

Although random code generation has higher coverage in posisilicon 
lesimg where the design can be tested at high speeds, it can also be 
effective in presilicon testing. When running on relatively slow presiiicon 
models, the effectiveness can be improved by adding more elaborate 
checking strategies and focusing the generators on smaller portions rif 
the design 

Some elements of a quality random code generaior include: 

• Coverage of the entire design 

• Fociis on complGK portions of the design 

• Low fault latency lie., a failure gets noticed soon after it occurs] 

• Reproducible test cases 

• Aids for debuggmg failing tests. 

Random testing techniques can also be applied to designs other than 
microprDcessors Memory nr I/O controllers can use these techniques to 
randomly generate machine state and transactions that will stress the 
conirDllers Designing special -purpose bus exercisers that are control led 
by rat^dom test generators can extend such testing mio the postsilicon 
environmeni. 



System Test A final element of die post.silican testing was 
verifying that operating syslenris and a[>i)lieati(HT iJi'ogrtUiLs 
ran pnit^'tlv *>it eoinpitter systems t)nilt around the 
PA TniKll.C, A large itniount olteslini^ vv^ls dune by several 
dirfert^id rjrganii^ali<nis within tiPatifl ituludetl r^petviting 
sysleni tf liability Wsis, Ijeruiiniaik tn'ogranis, aJtci key user 
applit^ationB. 

Verification Results 

The FA 7=3O0LC veritieation work was a success. Presiiicon 
t(^stinj^ €4iniinated r>ver 800 desii^n bugs, and more tban 
12tlO process rhaiige catiers were added trj I he nuxh^l in one 
year. Tlie iinnlity of ilu* llrsi revision of t hi m hip wu-s very 
high, Only eight ruticiXotuil bti^s were found in postsiliron 
testing, or these, only one aTfeded otir tiesign part tiers, and 
it fiad a siniijle vvorkaroiinil. The nP-l'X o|)eraliitj* system 
WiLS boott^d shortly alter t Irst ri\ ision parts arrived, t hir 
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IHistsilicon testing w£is far more extensive than what we had 
previously done witli the PA TKKILC or its predecessors. 
The verification efftnl enstired that the PA 73U0LC will main- 
tain HP's reputation for ([ludity processors. 

Debug Support 

The high level ofiniegration on the FA 73001, C rechice^ tlie 
visibility into chip ttperalion Mtnl aids in dehngging [irofo^ 
type sili(7jn. In [larf icujan moving the prim;n> rrK lies on(o 
the eliip removed a valuable sfJiaee of tlebiig data wliile also 
introducing a new source of potential functional attd elec^tri- 
caJ problems. 

Since the MK )(", float ing-poinl coprocessor, imd TLB iAic also 
cfintahied on I he same die, the only external pads visible to 
del) aggers were for the UO bus and the nienioiy interface. At 
the same time, ttie PA 7300 LC had neiA challenges such as a 
large primary on-chip cache, a new IC process, liigher oper- 
aliiig frequencies, mid a second-level cache. Delmg suj>poi"l 
was impoitmit to Lniprove the signal visiliility atid to reihice 
the risks associated with tlie new technology. 

Debug Mechanisms 

Signal visibility is of primary' importance whf^n rlehugging 
a faihire, scj several let hniques were used to nmkv iiHernal 
signals accessible. 

i idle r^'clcs on the GSC I/O bus wen* iiserl to di ive dt^biig 
hifonnation. 

> Seventi^en special chip pads are dedicaied to drivhtg real- 
time debug infonnation. To redaf e cost, these pads are not 
bonded in production parts. 

» Thorougli im]jlemenlation of IEEE 1149,1 and sample-on- 
Ihc-fly {a scan Uxlinique invented for the PA TIOOLC )^ -^ 
allowed a ver^' broad, but only one-cycle-deep, snapshot of 
the chip state to be reporteti, ( 'nstom data caplnre tiaidwarc 
was designed to galher the delaig traces and present them 
to a logic analyzer. 

New Pattern Mapping Faibire Isolation Teehpxque 

Tr ac es cai ) t u ret I b im i tl km 1 el ) ug p t )i1 s can 1 1 ( ^ o verwl le I min g 
in size, making h difneiill to isohne I he failure. The PA 
7300LC addiessed I his pjoblein by InipleiniMUing circuits to 
recognize internal ch\[} state ijat terns. Tlie patterns are pro- 
giannned from softwaie using special itisliiTctions iniple- 
mented on the PA 7300LC, and the catiture of tlebug traces 
cmi i>e (iret heated on a state pattern matcli. f)ebng traces are 
thus shoitened to cUi interesting region, h is also [jossifjle lo 
alter the program flow upon a patt erii mat ch^ allowing a 
branch to chagnostic softw'aie to probe for a faihire. By 
providing a flexible scheme for programming re pea table 
pall ems, the task of isolating a failm-e and pertoiTtiing ex- 
periments to determine its root cause was greatly sinipliiied. 

Target Applications For Debug 

Functional and elect heal verification were the primary' 
applications for which the debug circuitry^ was tiesigned. but 
the debug features were general enough rhat Lhey couid be 
used to diagnose processor problems encountered during 



bringing up the operating system, t^nnwaie development, 
imd benchmarking. 

Electrical verificatifHi relies more extensively on deliug haj-d- 
ware becatise failures cannol l>e re])rnduced in om- sotlware 
model of tht^ ( Pf, Kngineeni; vvf>rkiug to verify a chip's f her- 
nial antl eletnrical margins use del)ug features to investigate 
and understand failures ncrni ling at extreme operating 
points. 

Debug Features 

'tlie 1*A 7;:»CHIU' debug features iire intended to work in any 
en\1r(>mnent iisefi to test tJie C¥V — wafer test, package test, 
and system rest. The debug features are nperalile and poria- 
ble across these en\ir<mments. hi addition, tiel)ug circuits 
were riesigned to tighter sjiecitl carious tban the rest of the 
r*A 7300LC. This ensured \ivM they bmctioned properly well 
into the operating [egioris wheie ilie ( Pl^ core is expected 
(o fail We achieved this thnuigb (he use of simple logic and 
c oj isei vat ive timing bu dge ts . 

j\lthougb no major problems were fomxd during qualification 
of tlie PA 7:^3O0LC, debug features were rehed upon to help 
fix the i"jroblems that arose, helj^ing us to achieve quick time 
to mark el for PA 7;J!)0LL'-bascd systems. 

Conclusion 

The extensive verification f>f ttie PA 7:i00LC' design was 
b;ised on the successful strategy u.sed for the PA 7lOOLr. 
Imjnovernents were made hi the na>del building jirocess rind 
in the expensive use of random code gei'teration in 1 he presi- 
licon and posts! lie on phases. Many features were added to 
the PA 7300LC' design to allow efficient debugging of post- 
sihcon failures. Together, these efforis ensure that customers 
get the highest qu;:ility part possible, 

Ac kii o w I ed gm e n ts 

Many peor^h' contriliuted to the verification effort of the 
PA 730bLC including members of engineering systems labo- 
ratoiy in Fori Collins and the bitegrated Circuits Business 
Division at the Fort CoUms tlesign center. Key contributions 
weix^ also made by indi^itluals from the Ffiri ('ollins systems 
1 a b o rat o ry . ih e co j ni ) a I e [' I ecb n o i ( jg^y I ai )o rat o !>^ in C uper- 
!in<>. and the UNIX"^ development laboratory in Fort Collins. 
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An Entry-Level Server with Multiple 
Performance Points 

To address the very intense, high-volume environment of departmental 
and branch computing, the system design for the D-class server was 
made flexible enough to offer many price and performance features at 
its introduction and still allow new features and upgrades to be added 
quickly 

by Llii A> Nease, Kirk M. Bresniker, Charles J. Zaeky, Michael J, Greenside, and Alisa Sandoval 



As ihv (^onipiitpr iiidiisf ry conliniios ttj in at ore. system sup- 
pliers will rontimio lo find more* ( reativt^ ways to meet 
growing ciislonier expectiitioiis. Tlie IIP 9000 Series 800 
D-cUlss sender, a new low-enci system phitfonn fram riR 
ivjjn^senlsa ra*iically difTerenl apijroach lo system flesij^n 
than any of its pretk'tes.sors (see Fig, 1). 

The Series 800 D-c^iass sender comes at a time when sen,' er 
systems priced below" V.S. $20,000 are at a crossroads. 
Commodity tecknologies me ujjsizing, enf eri>rise ctisfomei's 
aie eiyijyijig choices otVtnodiict fmiiilies that offer Ihnusands 
(jfapiilications, open romiuitiiigand netwurking have l)lnned 
die dlslincljons between conipetihn^' ulleriiigs, iind njially, 
indirect marketing and integration channeLs cm\ offer fom- 
peUins value liuiidles that were once the exclusive domain 
ofhig. direti iiiarkel iiig conijjuter sysfeni suppliers. Thes*' 
ll^^nd^> have (realed an f^nvironmeni of very intensi^ high- 
volume coittpelition for contn>l t?f depannientiit iutil irrandi 
C'ompuiing. 

To address this environment, tlie system design for tlie D- 
class sener had \o be Fli*xit)le enough to offer many price 
and perffKinanci" featuren at its introdm-tiun and still allow 
the addition of new featitres and iii>grades lo tlie system 
tjuickly. The server's cotnjietitive s[>aee. lieing t>road and 
heterogeneous, also tiemanded that the system be able to 
accr)inmo<lale leelintilfigies originally ttesigned tor oiher 
products, including teehnf>logies front sysienks that cost 
more than the iK^lass s?en ei-. 

System PartitUiiiing Design 

In designing the iHlassenriy-ievel s<^n'ers, cjne of tin* pri- 
niai>' goals was to c^reate a new family of servers that ctjuid 
be intrciduced with multiple performance points without 
any investment in new \TjRI ASIC's. The signers also had 
to hi' 4*apalile of snjjjHUtlng new advam es in juocessors 
and memory mui l/Q subsystems wilij minimal system 
reengineering. 

The Hen'er family would cover a siisus of perlormame points 
that hart previously been covered by sev<*ral classes of send- 
ers whith, while they were all lnnm>- conipatibh^ with five 
PAdJlSC ;n'chiiectui'e, had vei^ diOerent physical implemen- 
f at ions. The Ir^vsci -perl oruia a r<' point designs would he 



di'awn from nniproces^or-cmly PA TIOOLC-based E-class 
systems. The upperijerfomiance-iioint designs would be 
drawn from the one-to-four-way muhija-ocessor PA 72(K)- 
b^Lsed K-cUiss sjiitenis. The physical designs tjf tht^se systems 
varied widely in nuuty ;ispect.s. Issues such as 5,0V \ersiLs 
:jJ.:3V iogir, a single sysfem clock versus separate clocks 
foi' l/D and processors, atKi vvlietlTer I/O devices would be 
local <^d on the t>rocessor memoiy bus or on a separate L/D 
bns had to be resolvtMt tiefore the existing designs could be 
re] unlit ioncil into com| j/iI ible physical and logical subsys- 
tems. Tallies I and II list the key performance points for the 
IIP fH)0(J E-c:lass, K-class. and D-chiss seiTers. 




Fig, L The I)-c:l;iss Hen^r svsltaii tal>aH't, 'fhc tliiiiensiun.s nf this 

f:fi|jiluM arc in.2 in f2ti nn) widf, 2H,8 in ffiU,-! nu ) high, and 22.2 in 
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Table I 

Key PeHormance Pomts for HP 900C K- and E-Class Servers 



Model 


Processors 


Clock Speed 

(MHz) 


Multiprocessor 
Configuration 


Memory 
(M Bytes) 


Cache 
(1/D K Bytes) 


HP-PB 
I/O Slots 


HP-HSC 
I/O Slots 


Disk Capacity 

(G Bytes) 


K2x0 


PA 7200 
PA 8000 


120 
160 


i-to^'Way 
1-to^-Way 


64 to 2048 
128to204« 


64 to 256 
256 


4 
4 


1 
1 


3800 
3800 


K4xO 


PA 7200 
PA BOOO 


120 
160 to ISO 


14.0-4- Way 
l-to-4-Way 


128l:o:i840 
128 to 40% 


256 to 1024 
1000 


8 
5 


8 

D 


8300 
8300 


Ex5 


PA 7100LC 


48 to 96 


1-Way 


16 to 512 


64 to 1024 


4 


4 


1^ 





Table II 

Key PerfDrmance Points for HP 9000 D-Class Servers* 







Clock Speed 


Multiprocessor 


Memory 


Model 


Processors 


(MHz) 


Configuration 


(M Bytes) 


D2x0 


PA 7100LC 


75 to 100 


l-Way 


32 to 512 




PA 7300LC 


132 to 160 


1-Way 


32 to 1024 




PA 7200 


100 to 120 


l-to^2^Way 


32 to 1536 




PA 8000 


160 


l-to-2-Way 


64 to 1536 


D3xO 


PA 7100LC 


100 


1-Way 


32 to 512 




PA 7300LC 


132 to IfiO 


1-W^ay 


32 to 1024 




PA 7200 


100 to 120 


l-ro-2-Way 


32 to 1536 




PA80O0 


160 


i-to-2-Way 


64 to 1536 



Cache 
(1/D K Bytas) 

256 

256 to 1024 
512 

256 
64 to 256 ** 
256 to 1024 
512 



HP-HSC 
1/0 Slots 

4 

4 
4 

4 
4 
5 
5 



EISA 
I/O Slots 

4 
4 
4 
4 

7 
7 
7 
7 



* HP-PB sfDtg = and disk capactly = 5 Tbytes. 

* 1 M byias- of second- level cache. 

Additional constraints on the design were a direct result of 
competitive pressures. As tlie presence of Industry Standard 
Arcfiitecture-based systems has grown in the entr>^-lcvel 
serv^er space, the featuies tliey offei' became D-t lass require- 
ments. Tliese requirements include suppon for BIS A 
(Extendefi Indusli^ Stand^ird Architectiue) PC) cards and an 
increase in the standard warranty period to one year. Both 
of these requirements were new to the Series 800. Also new 
to the Scries 800 was the desire to design a system enabled 
for distribution through the same type of independent dis- 
tiibution channels used by other server vendors. Add to 
these constraints the cost sensitivity of products in this 
price range. an<l we have a system that uses as many itkIus- 
try^-standard coinpoiienis as possible, is extremeiy reliable, 
mid is capable of being assenibied by distributors, all with- 
out compromising aJ\v perfonnance benefits of current or 
liiture PA-RISC' processors. 

Feature List, The first step in the process of partitioning the 
system was 1 o detail all possible features that jnight be 
desired in an entry-level sei-ver. Tliis list was compileti by 
pulling featuies from oiu" development parttvers' reqiiiren^ents 
analysis and from know ledge of oiu: competitors' systems. 
Once tins feature list was developed, each featiure was evalu- 
ated against all of oiu- design goals (see Pig. 2). Each featiu-e 
was then r;inked in terms of its relative need (must, Mgh 
want, want) and technical difficulty (high, nxedrnm, low), 
Detennining the possible feature list was the first goal of the 
partitioning process; the hst w-as continiially updated during 
the entire process* 

Once the initial feahne list was created, a small design team 
consisting of a mechajiical engineer, an electrical engineer, a 
fimiware enguieer, a si^^stem architectj and a system manager 



began analyzuig the list to see how each feature would affect 
the physical partitioning of the system. The goal of this 
process was to generate a fully pariitioned mock-up of the 
physical system. Successive passes through the featuxe list 
led to successive generations of possible desigr^Sn With each 
generation, the hst w^as reevaluated to determine w^hicli fea- 
tures cotdd be achieved and which features coxild not. 

Physical Partition. After the fust few generations it became 
clear that a few" critical features W'Ould drive tfie overall 
physical part it ion mg. The physical dimensions would be 
determined by the dimensions of a few^ key subsystems: the 
disk atray and removable media, the integrated PO connec- 
tors, the PO card cage, and the pow-er supply (see Fig. 3). 
All of these components were l\ighly leveraged from existing 
designs, like the hot-swap disk aiTay module developed by 
HPs DLsk Memory Division, and the industry-standard 



Feature 


need 


Difficulty 


Frpnl pane! display 


Must 


Low 


HP 100LX Palmtop fronlpanel dbptay 


Want 


High 


TwO'lJne LCD display 


Higfi Want 


Medium 


16-characier display 


Must 


Low 


Backlit displav 


Wafii 


Medium 


Hexadedmal status infannation 


Must 


Low 


Eaglish status information 


High Want 


Medium 


Lscalfzed language status infurmaticin 


Want 


Htgh 


Display suspected lailure causes aHst faults 


Must 


Medium 


Include power- on LE£} ipiticator 


Must 


Low 


Include disk-access LED indicator 


Want 


Medium 


Include blinking 'Boot in progress" indkator 


Want 


Medium 


Heset switch 


High Want 


Medium 


Reset switch with hey and lock 


Want 


Hifh 



Fig, 2. The feature list for the D-class servier's front-pajiel display. 
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Eight I/O Slots 



PrtMres^ot Cirri tr 



Ningitft Door 



Power Supply 




Powtr Switch Mourn 
Dtspfsy Moum 



"^-'^..,^ Airflow 



Mint-Fle^xible Disc 



Removable Media Slots 
(Two 1/2- inch -High Devices) 



Oisix Cavity 

IHot Sw&p Optional^ 



Fig. S. The chassis for the Series 8013 D-class sender The features that detennlned the overall size of ttiB chassis wem the disk cavity, the 
removablcf media slots, the power supply, I/O slots, and processors. 



fonii-factor power supply. The first mfyor design conflict 
arose when we realized that tJiese components cotikt not be 
integrated in a package short enougli to fit under a sf atichird 
office deskt and yet narrow enough to allow two tuiils to be 
racked next to each other in a standard rack. Numerous 
attempts to resolve these two confliering dematwls only 
succeeded in creating a system that would violate our c*o.st 
goals or require more new invention than our schedule 
wotdd ailow. 

hi the end, it was detennitied tJiai ttiedesk'tieii^ht require- 
ment and cost goals were more important than the rack 
requirement, so tJie package wasslionened and widened ro 
accommodate all the critical component-s in a package that 
would tit tinder a standajd office flesk. Once this w^as de- 
cided, the system mock-up came together tjuickly; and the 
second goal was reachetl. The system partiii*>niug shown 
hi Ftg. 3 provides se\T!al benefits necessary to achieve our 
goals. The standard PC-type power supply helped us to 
achieve new lows in cost per watt of power. Tlie division of 
the box into the lower core I/O and disk array volume and 
the upper exp^msion I/O slot and processor area helped to 
simplify the design of the forced-air cooling system since it 
separates the large interior volume of the box into two more 
manageabh* rt^gions. 

Printed Circuit Assemblies. The next goal in the process was to 
rejiaitition iind iniegraie The disparate design sources* irUo 
logically and physically compatible printefl cinuii jLssembhcs, 
while maintaining ail of our design constraints on t^ost, 
expandability; mnl design for distribution. Agdn, a single 
cmcial design declsi4>n helped t*j riuickly t>^ul illon tlie system: 

' The design sources were pans from E-class ami K^class sarv&ru arid the J-cltss warkatations 
thilwere cQmbmetJ to farm the D-:lassssrvers. 



the D'Class would not use any higli-speed, inipedance-con- 
IroUed connectore- This decision was made as a direct result 
of the K-class development process aiid the success of the 
Series 80€ G/IM-class Model 60 and 71) systems. The K-class 
development process showed thai although high-speed im- 
pe<imice-(^ontrollefl connectors can add excellent flexibility 
and exi^andability to midr;inge sysiems, tiiey require a great 
deal of mechanical aiid manufactimng infrastructure. 

The processor modules for the G/Wl<\jdss Models (>0 and 70 
are the uniprocessor and dttal-processor versions of the 
same board (see Table 111). Tire same printed circuit board Is 
loaded witli either one or iwo prtjcessors at the time of man- 
ufiicture. To increase tJu^ numt)er of ;>rocessors in a systent, 
the entire processor module mitst be replaced with a new 
printed circuit assembly. Olher systems^ hke the K-class 
seivers, allow for the uicremental increase of the number of 
processors hi the system with just the addition of new pro- 
cessor modules. Even thougli tlie board swap is a less desir- 
able upgrade path tlian an incremental upgrade, the success 
of the Models 60 and 70 systems led us to believe that it was 
quite acceptable to our castomers. 

This decision simplified repartitioning the design sources, 
since it meant that the high-speed processor and memoir 
clock domains Euid their data patlis could remain on a single 
printed circuit assembly, while tlie moderate-speed I/O do- 
main and its data paths could cross multiple piintetl circuit 
assemblies. It wi\s determined that both the dual 1/0 bus 
architecture of the K<-lass and the single 1/U bus of the E- 
class would he supported in the system. To do 1 his, the con- 
nector tec'hnology used in the D-class is modular, allowing 
the dt^sigiis If) load only those portions of the connector that 
are siiptJculiMk 



Mm 1 1W7 I U" wlott-Packarri Jotimal 75 



)Copr. 1949-1998 Hewlett-Packard Co. 



Table III 

Key Performance Points lor the HP 9000 Series 800 
G/H/Uclass Model 60 and 70 Systems'" 



Model 


Multi- 
processor 
Configuration 


Memory 
(M Bytes) 


HP-PB 
I/O Slots 


Disk 
Catiacity 
(G Bytes) 


GOO 
(170 


2-Way 


32 10 708 
32lo708 


4 
4 


150 
150 


urn 

H70 


I-Way 
2-Way 


32 to 768 
32 to 708 


S 
8 


30{] 
300 


170 


l-\Vay 
2-Way 


04 to 708 
m to 70S 


12 
12 


330 
3^0 



• Processor- PA 70DC, cJocJ* speed = % MHi. Caciie per CPU (InsiFuctJon/DaisI = 1 M bv^e/" Pv' 
byiB, and HP-HSC slots - Q 

This lowers botli tlit- material aiifi ass(>nibly costs. To iuHher 
lower the matenal cost, liie I 'A 710()LC-baset] processor mnl 
memory module is ral>iii^£iU^(l «m a smaller primed etreujl 
board thrni the PA T200-[)a.s(Hl jjroceasor module. The dlfler- 
eiice in tlie size of the modules is accommodated by att.a€h- 
ing them to a sheet -metal carrier that adapts the modules to 
a common set r^f t ard guides. Not only is the slieet nieial 
cheaper Than I he corres])fjiKiiii^ ]uirted tin iiif luaterial 
would have been, but i( is also stronger and easier to hiseit 
atid remove. 

A Becondar>' benefit of this strategj^ is that it allows new 
investment to be irtac!e as needed. Historically, 1/0 subsys- 
tems and 1echnolog\' are much loiiger-li\ ed thmi processor 
iind memoi-y tecbnologies, The i>artitioning strategy we used 
helped to decouple the Ui) subsystem from the processor 
and memoi-y. As long as they remain consistent with the 
deiiued interface, ijrocessoi' intxhiles are free to exploit any 



technology or adapt imy <iesign desuiible, TliLs also enables 
l)-class servers lo excel in meeting a new and growhig re- 
rjiiirenient — design \or reuse. A customer is able to upgrade 
through many lunrormaiu e [joints simply by ch^mging pro- 
eess(n" modules. As sojue comi tries are in\esligatitig forcitig 
matiufaeturers to accept and recycle old equipmenl. keeping 
the return stream as small as possible is highly desirable. 

Once the printed rireuit assembly boarcj fjut lines were com- 
plete, the process of adapting the various (iesign sources 
to the new pmlitioning was time-consuming, but relatively 
straight for wai'd. Fig. 4 shows die the vaiious design .sources 
that weie pulletl together to form the PA 7200-based proces- 
sor module. As fioiiious of designs were uicigfMt. altered, and 
recombincth the possiLnlity of traiLst ri[>1ion iuiomgiew-. The 
original designs were executed by three different labs and 
many different design teams. All (iesigus were fully func- 
tional as designed, tmt we were extending designs as w^ell as 
integrating them. lu an effort to minimize tlu^ possibility of 
errors being hitroduced during die adaptation process, tlie 
schematic interconnect list was extracted and trajislated 
into a simulation model. Tliis Tnf>del was then added to the 
motlels used to \'erify the tuigiiuil tlesigns Lo ensia^e thai no 
new errors had been iiitroduceff 

System Partitioning and Firmware Design 

Because td'the pari ilioning scheme used for the D -el ass 
enliy -level servers, the finuwart^ dt*sign was a critical factor 
in achievhig the ov<:*rcill program (ibjective of low cost. The 
tumwai e design a^idr<^ssetl cost issues in support of the majt- 
aiacturhig prtjcess, Held sti]jpoi1. mid the upgrade strategy. 
In addinrm. ah hough tite und(u1ying hardw^me is drmnati- 
cally <ii[lerent deperuliiig upon whicli ijrocessor module is 
installed in the system, from the customer's perspective, the 
external beha\ior of each [lerfonuance point should be the 



PA 7200 l\Aaduk 



K-Ctass Memory Extender 




K-Class Matl>aii)03fd 



Fig. 4. The vanotiH dt^ign source-s 
t hat. were pulled ttigelher to form 
the PA 7200 LNriass module. 
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same. For the firmware desi^ team, thai meaiit that regard- 
less of the underljing hardware, the entire D-class had to 
have rJie kK>k and feel of a single produa. 

D- CI ass Subsystems. From a firmware perspective, the D-cls^s 

is partitionecJ inio t^^o subsj-stems: the system board and tlie 
processor modules (see Fig. 5). The system board rontaitis 
aU of the I/O residing on or hanging off the HSC (lii^^peed 
system connect ) bus. This includes optional I/O modules 
that plug into the HP-HSC slots, such as the fast-nide SCSI 
and graphics cards. It includes core Lj^O built into the system 
board, which provides ^rial interface-s. single-ended SCSI, 
Ethernet LAS. a parallel interface, a mouse and keyboard 
interface, and a flexible-disk interface. The EISA bus, which 
is connected at one end to the HSC bus, is also foimd on fhe 
system board. The Access Port/'MUX card/ which contains 
its own HSC-to-HP~PB I/O bus converter, also plugs into an 
HSC slot. In addition to these I/O buses and devices, there 
is an BEPROM and two hardware dependent registers tiiat 
hold I/O configuration information. This nonvolatile men^oiy 
and the configuration registers are critical to the partition- 
ing and upgrade strateg\^ for the D-class seiner. 

■ HP Access Port is a tool ior pravtding r&mote suppait for HP sflivers, 



The core of each processor module houses the CPU, instruc- 
tion and data caches, and memor> subsj^tems. .^Iso on the 
processor board is an EEPROM and two more hardware 
dependent registers. The PA 7100LC uses the HSC as its 
native bus, so its connection to the system board is rela- 
ti\-eiy straightforward. Bowe\'er, the PA 7200 requires a bus 
converter between its tmtwe bus and the liP-HSC bus. Thus, 
to ha\'e one system board contmon betwetm the two proces- 
sor modules, the PA 7200 processor board was burdened 
wxtit carr>dng the bus conv^erter circuitr>\ One other signifi- 
cant dilTerence exiscs between the two processor modules: 
scratch RAM.** Tile inclusion of 32K liytes of static RAM on 
tile PA 7200 module meant that system variables and a stack 
could be set up vet>^ early in the boot process. The lack of 
this scratch RAM on the PA 7100LC limited the amomit of 
code that could be common between tlie two platforms. 

Consistent Look and Feel. Tlie goal of having the same look 
and feel for the entire product line could be met by having 
one common co<ie base for all performance points, but 
because of the signifjcimt hardware differences mentioned 
above, this was not possible. The primary differences lie in 

' Hie PA 73D0LC and the PA 8ffl30 aJso inclcjde scratcti RA/vl 



Process pr 
Modules 



PA 7200 Boards 



PASfWOBoarits 



PA 710fM.C Board 



Combined 
UfidP 
Cache 



r 



PA 73(K]LC Board 




Exicrrnfil 

iMBylfi 

Cache 

lOiitiofial) 




Cofitbined 
1 and D 
Cache 




* 









HSCO 



T 






1 und □ 
Caclt^ 

■1 


PAKXN} 


t 


1 


Memory H 


Bus 
Convefter 



KSCO HSCO H5C1 

HSCO HSCt 
System Boiftf 



HSCO HSC1 




Access 



HSCO 
Slots 



L 



Tarbij HP-HSC 




EISA 
SbH 



Fig. 5. A firmware perspective of the D-class subsysfems. 
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the processors themselves. Tliere is no commonality in the 
control antj sf af us n^^isters of the two proeessors, raeiies 
me aceessefl tiilTerently. m\(\ the aieinoi>' sufisysierns are 
tfjo dilTereiiC in slime code. These (iirJerencc^s, aion,t^ with 
otiier hartlwaie ineojiipatibilities, meant tlial each processor 
niockile nt>eded l:o have lis own sepai'ate iuid distinct code 
iiase. However, f)eeanse die priiiinr>' tlifrenMicc^s ht^tween 
tile two PA THHJLC jjHM t'ssor versi<jns (7ri MHz or ItM) Mil/) 
and the two PA 72f}() processor ni<ntnh^s are proc*essor speed 
and cache size, all performance poinis that use a conimon 
CPli can he supported l)y one conuiion code ba,se. 

Tlie three areas needed to give the ]jroduct line a consistent 
look and feel hiclnded a eon in ion feature set, similai- strate- 
gies for han<1iing mid reinmiug eri'ors, inul a conimon user 
interface. To ensure consistency in this regard, one engineer 
was given rest>onsibiliiy for the same functional area of each 
platfomi. For example, the enginc^er who worked on niemrjry/ 
cocie for tiie t*A 7IOO[.C also hud responsilnlity for the mem- 
oiy code on the PA 72tK) platform. Taking advantage of this 
synergy paid ol T especially well in the design and implenien- 
tation of ti\e user interface, where differences between plat- 
foinis could easily lead to confusion. 

System Configuration. I'mtitioning the KEPROMs between the 
processor tjoaiil imd the system hoard is u key e^nahUT of th*^ 
upgratle strategy. Since an upgrarle ctjjisists of leplacing the 
processor module, I/O configuration infcjniiation mast remain 
with the system bojird. The EISA configuration, graphics 
nionitor ty]>es, and LAN MA(' address me stored on the sys- 
tem board, Atidiliona! contnd information is used to check 
for consistency between the two EEPROMs. The firmware 
exf>e€tB the foniiat of llie system board EEPROM to he the 
same, regardles.s of which processor module is installed. 
With all I/O coufimiraf ion infoiTiiation and contiol vaiiables 
in the same location and stiaring die same set of values, pro- 
cessor nioduies ran be fteely swapper! without chmigingthe 
I/O configuration. 

Dyiiaiuic^ cunfiguration of the system is used to support die 
upgrade stiategv, the jti arm facta ring pro cess, imd field sup- 
port. When a D-cla.s.s server is [xiwered on, stale variables in 
the pixjcessor nicKlules and system btr^trirs ii(.)n\ olatile mem- 
ories are tested for a valine that indicates wlietliei^ or not they 
have been initialized and configure<i. If they fad Itus test 
(whicit is always the ci:is(^ for initial turn-on during tlic m;iii- 
iifaclurirtg l^iocess) the system's hai'dwm'e contlgu ration is 
analyzed and the corresponding state mid control variahles 
are set. Mucli of this information is available via ttie hard- 
wai^e de|>endem registers located on each boaixL Tlie jiroces- 
sra^ frequency^ system board t>pe, tuid other details concern- 
uig the I/O cordiguratifm can lie rc^ad from these registers. 

The stat^ variables, which aje set as a result of examining 
the hmdwarv configuration, include the system's niodel 
idenrifier (e.g., HP900d/SSl 1/D310), hardware version 
(EnrERSION), and patlLs to tlie boot mid console de\ices. 
Hie boot path can be either the built-in single-ended SCSI 
device or a hoi-swappable fast-wide SC-SI device. The tlmi- 
ware checks for the presence of a hot-swappable device 
ivluch, if present, becomes the default l>oot [jath, Othei'wise 
a single-ended SCSI device is configured as die default boot 
[jadi- The aclual hard\^m'e configuration is idso examinetl to 
select mi appropriate console path. The default console path 
can be either the built-ui serial port, the HSO Bus Access 



Porl/MLIX card, or a graphics console. Depending upon the 
presence of these devices and their configurations (e.g., a 
graphics device must also have a keyboard! att ached )> a con- 
sole path is selt^ct«*d according 1t> rules worked out in coop- 
eration widi the manufacturing and suppoit organizat ions- 

The same sequence ol' events occurs when upgrading or 
replacing a processor module. In iMs ease^ the system board 
IS already utitialized and only the processor module requires 
conliguralion. On eveiy tioot, iufoiTtralion srrch as the model 
identiller is ehec ked agairrsi the actual hardware confrgura- 
tion and any mismatch will invoke the appropriate c onfigu- 
ration actions. Likewise, because some information is kejit 
re d 111 T d an 1 1 y be\ wei^i i d i e | > n k ess* ) r m od u I e an r 1 1 h e system 
boaiTl, they can lie checked fta a misinaiciL Tliis rcdiwKlancy 
niemis that the sy stern board can tilso be re] j laced in the 
field with a minimal amount of manual reconfiguration. 
Because a I>-class ser\"er can consist of any combination 
ot two syshNii hoards and several different processor mod- 
ules, atirl betiuise further entrance men ts will double the 
nimiber of processor modules and include two liew CPUs, 
dynamic configuration has obviates! the expense of develop- 
ing ex I er rial configuration tools, reduced the conij>lexity of 
tlie Muuud'acturing process, and suuphfied field repairs and 
upgrades. 

System Packaging 

Mci'lianit at packagiirg is one^ of the key vai'ial)les in mauitain- 
1.11 g a competitive edge in the scnver market. The challenges 
involved in the system package rlesign for the IIP 1)000 D- 
c I as s nerve i\nv\ u< 1 e d i r k1 ustri al d esign , n lamif ac t ural ) i 1 ity, 
KM! containment, thermal cooling, and acoustics, while 
having \\\e design focus on low cost. 

The D-class low-cost model was based on the high -vol rime 
jjersonal cxjm]juter mai ket. However, unlike personal com- 
puters, serv'cr products must support multiple configurations 
witli an etusy upgrade path and high availability. This meant 
that the t)-cliiss package design had to be a highly \'ersatile, 
veiliCfii tower wilh tlie alii lit y to be rack-ntounled in astaii- 
darfl ElA rack (see Pig. 6). It allows for multiple processors 
mid power supplies, mid can support up to eight i/O slots for 
EISA imd GSC cards. The desigi^ alsr> su[>i>orls H[i to n\'e liot- 
swapiiable or two siiigle-enrleil disk drives and two single- 
ended removable metUa devices with one IfJlj (Integratecl 
Device Electronics) mim-Oexible disk (Fig. 7). This diversifi- 
cation ijrf)\i(.ies the entiy-level customer with a witle nmge 
<.ifconfigu rations at various rnice/performmice jjouits. 

Manufactnnng and Field Engineering Support. Concurrent engi- 
i\eering was a key comriliutor to I he design for assembly 
(DEA) and design for mmiufacturrng (DFM) successes of the 
D-class server. Since we are ver>'' cilsI omer forused, we take 
the disassembly and repair of the unit jirst as seriously as tJie 
manufactiirabihty of the product. The D-class niecharrical 
lean^ worked closely \\ith key iiartiiei's throughout the pro- 
grmn to ensure the following assembly antl manufacturing 
features: 

• Asiugle-btiild orientation (common assemblies) 

• xMultiple snap-iu features 

• Slotted T-15 Torx fasteners (Toi^x f listeners aie used for 
IIP numufacturing, mid the slot is for customei-s and field 
engineers. ) 

• System board tliat slides into the chtLssis 
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Door lock 




Fig, 6. A front and back vie^srof the D-dass server. 



* Quick access to all components 

• Maiiuf actLiriiig line for high-volume production. 

EMI. Design for EMI cont^nment was a considerable chal- 
let^ge for the D-ciass sender program. Tlie package goal was 
to contain clock rates up to 200 Mllz. This required a robust 
system design ami two new designs for the EISA bidkheads 
and core 1/0 gaskets. 

The system EMI design is based on a riveted sheet-metal 
chassis using a sIot-Emd-tab methodology for optimmu nian- 
ufacturability, A cosmetic outer cover witli a hmged door 
completes the EMI structure. EMI is contained using contin- 
uous seams and EMI gaskets with small hole patterns for 
airflow. 

The EISA bulkhead gaskets required a new EMI design. The 
new^ design is a slotted pyramid that forms a lateral spring 
element with a low deflection force but a iiigh contact force 



Power Switch 
Cover 



Displav 

Mini-RexJble Disk 
Removable Media 

Hqi Swap Disks (59 




Fig. 7. TJie rj-t'la.^s rablnp! wilh l.in' iUmr removed. 



(see Fig. B). The new design for the I/O gasket includes a 
foam core wrapped with a riickel-over-copper fabrie, which 
provide.s a SOO-tlegree contact around each comiector (see 
Fig. 9)> These new ciesign.s produced excellent results. 

Thermal Design and Airflow. The thermal design for tlie D-class 
server also had some interesting challenges. The design 
strateg>' liad to encompass muHiple configurations and 
nuiltiple processor cliips and boaicLs. Some of diese options 
were in developnient, but most were future plans. A thermal 
aiialysis picigrani^ Flotherm, was used to develop die tliemial 
solution for tlie system. 

The Flothemi models aaid tests resulted in the package being 
seijarated into two main compartnients. The top half, which 
includes the t/0 and the processors (see Fig^ 3), is cooled by 
a 12-nmi tubeaxial hm. The processor chips are located side- 
by-side direct ly beliind the front fan^ giving an approximate 
air velocity to the processors of about 2.5 nVs. Heat sudcs 
are used for processor chips that consimie under 25 watts. 
For chips over 25 watts, a fan mounted in a spiral heat sink 
is iLsed. 

The bottom half of the package includes the peripheral bay 
and power supply and components. It Is cooietl using a 
12()-nuu lubeaxiai fan. However, when the hoi'.swai) disks 
are in use, a separate cot)ling system is installed. The hot- 
swai> bay is a sealed subsystem that uses a small l>lower to 
1)11 1 1 air through tiie disks. Ar\v disk can be pulled out and 
I he airflow to the otiit^r tlisks remaiiis relatively michauged. 
The power supply has its own 92-mm fan. 

Acoustics. The acoustical goal for the DH?lass ser\^er was 
designed to be 5.4 bels at the low end, wtiicrh was die smne 
as for earlier sender products. Ttiis package has a higher 
power density than previous products, more versatility, 
higher-speed discs, mid an off-tiie-shelf power supply raUter 
than a custom one. The fan in tlie |30wtT supply ended 
up being the loudest component of the system. Still, the 
system came in at 5.4 bels at the low end by custom tuning 
sheet-metal parts, baffles, and fan speeds. 
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1/0 Frame 




Gasket Over l/Q Frame 



EISA Bulkliead 



Fig. 8. Thp EIf5A bulkhead gaskets 
for tJiC! iJ-clasR snrvf^r. 



High Availability and Ease of Use 

Hot- Plug Internal Disks. .\n imijortant feature thai the D-rlass 
sender has brought to the Series 800 product line is liot- 
pluggable internal disk drives. UTiile commodity senders had 
once pro\1ded fomis of inteiTiaJ hot-phiggable disks, these 
solutions were deemed too hkely to cause data corruption 
for use in Series 800 systems. For example, the commodity 
hot-plug sohitions evaluated for use in Series 800 systems 
had the issue of "windows of vannerabilily" in wliit h sliding 
contacts from a sw^apping disk on a SC SI l>us could cause 
data bits to actually change after parity had already been 
driven, causing undetected data errors. WitJi a low i^robabil- 
ily of comiption, this approach may have madr sense for a 
workgroup file or print scr\^er. However, Series 800 servers 
are expected to operate in mission-critical en\iromnents. 



Thin Gauge 
Stainless Steel 




Copper- Nickel 
Mesh Over Feam 



^^ 



Paint Contacts 




3fiO- Degree 
Contact 



Old Design 

PQOf EMIand 
ESQ Resulls 



New Design 

Excellent EMI and 
ESD Results 



Fig* 9* A comijarison b*?tw^eeri the new and old core I/O gasket 
designs. The new design provides a 360-c!egree contact around 
each connector. 



Therefore, a more robust approach to hot-s wrapping disk 
diives had to be developed for these products. 

The approach used for the D-class server was to pro\ade a 
hot-sw^ap solution usmg logical volume management (LVM)* 
and mirroring middlew^me facihties^ iuid to offer a disk-drive 
carrier conmion to the standalone enclosure disk carrier 
being separately developed for the rest of tlie Series 800 
product line. The connuon ciu'rier approach would allow 
the field to learn only one solution and guarantee a higher 
volume of pads. In addition, solutions for tiie data corrup- 
tion problem, the use of sequenced SC SI resets, and an auto- 
mated sw^ap script could be shared by both the enclosure 
team and the D-chiss server team. 

Mission-Critical ServiceGuard and EISA I/O. HPs MC/Sei-vice- 
Guard product (portions of w^hich were previously called 
Switchover/UX) has been an unofficial standartl in the 
application-ser\^er industr>' for sevei al years now, Tbis 
uiidtileware product allows Series 800 systems to operate as 
standby servers for one ii^tother, transferring niissi on-critical 
w^orkloads from a failed system tc) its standby. This featiure 
requires that a system and its standby host transactions on 
the same mass storage buses, enabling the standby system 
to have access to ah of tiie primaty system *s data. Multiple 
hosting (knowTi kis multi-iiiitiator) on tnass-storage intercon- 
nects requires significant design attention, hi addition, the 
LANs that allow these systems to commimicate witli one 
another must offer special capabilities in the areas of error 
handling and reporting. 

A significant challenge for tlie D-ciass design was merging 
the w^orkstation, PC, and 1/0 infrastnictures w ith the Series 
BOO infrasintctm-e, Tliis requires supporiing MC/Servlce- 
Gnajcl capabilities, higher slot counts, more extensive error- 
baiidhng, and a remote support inlVastnicture. Before the 
D-class server, all Series 800 systems that supported MC/ 
ServiceGuard were HP Precision Bus (HP-PB) based, 

' IVM is sofrwafe that a I tows the number of file systems to be rBlatfveJy independent of the 
niimber al physrtal devices. One file system tan bespread over many dsvrces. of one dsH/tce 
can haye many fite systsn^s. 
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The rU'la^^ servers use of EISA aiid HP-HSC t with the EISA 
form fad or) refiuired the leant to iiiipleiiitmi, <It*byg and 
verify MC/S€»r\1rc*Gimrd functionaliiy on These new VO 
buses. In the process, various I'O tmple mentations had 
to be modjOecl, suc*h as sjierial EISA bus openiting modes. 
SCSI adapter fimctionalit>; periodic EISA cleanups, gtianin- 
reed arhitmtlon for cf>re LAN. queueing inuisiu-tiorts on HP- 
llSi\ extra bufferiuj^ of daia signals, iuul slot ttiririguralioiis. 
These modifications matle the IIP-H8C and El^^A l/t> infra- 
iitnuiure more robitsl in a highly availabie depanniental 
branch server. 

In addition, IIPs Access Port product, which is used for 
remote support of IIP servers, only existed on the HP-PB. 
Without HP-PB slots, the D-class sei%er would ha%e bad to 
either forego remote sup|>ort or develop* a new ciirci Tlie 
answer was uol to develop a new cartl ImiI to levenige exisi- 
iug logic by sui>ijlying a burit^l HP-PB on the new Access 
I\)il card. To the user, the I)-chtss serv-er's Access Von is 
km HP-HS(' card. However, lo the operatiug system and 
response center engineer, rlie fJ-t lass server's Access Poi1 
looks like the familicU, compatible HFM^B c^^u'd foiuid oti all 
the other senet^ iiroduets. 

PushbtittOfi Graceful Power Shutdown 

The I>class server is the first. Series 8(K) system to offer push- 
button grat'cful power sluitrlowrt Btisitally- when a D~ciass 
systeiu is up and ruDtiing. the p<jwei" hutiou is equivalent to 
the couuuaiui reboot h, wliit h causes tlie system to s>mchro- 
luze its hulTer eat he* aod giace folly shut dowii. Tiiis feature 
is most useful in a branch offic-e or department where the 
server is miniuuilly luana^t^eil liy Inriil pei-snuticl. Single-user 
HP worksiaf i(His had introduced lliis fccUure lo TA-UISC 
systems. 

Built-in Remote Management 

VVitli an emf)hasis on leuu^re seiver managejn(*ut, th(^ D- 
class st^rver It^ain fi*'cidtHi to offer the same rohtist, world- 
wide-usai>le internal modem as the K-series products* This 
uiofii'Ui offers siit>t)ni1 for transfer rates well beyond those 
used by i(><lay s HP lespuust* centers and is iiUegriiled vviih 
the J'emtjie sn|>porl assembly in D-class sysleius, Thi^ jutxl- 
u<^t also offers a special serial ptul for ct>n1 rolling optional 
utiiuterruptible i>ower supplies (UPS), as well bs pinoiU del V 
nilitins for fulure diren eoiUroI t>f inlcrnal power-supply 
signals. 

Th(^ l)-<*lass server team also aceomutodated "cousoleless*" 
systems. wliere!jy a D-ehiss ser\er c^ui he ctjni|jk4ely man- 
agetl remotely without a tof^al cottsole at all. lu addition. 



graphic's console customers can still take arhantage of 
remote console mirroring (fonuerly reseneit strictly for 
RS-232 eousoli^j by merely Otpping a switeh on the product. 

Conclosloti 

The system pariitlcming design for the first release of the 

D-cIass seners beli>e*l u> achie\e all of our introduction 
goals. We were aliie lo iiuriMhu^e both PA TRNJLt -based and 
PA 72(Xl-biLse(J poM'essor uiodnles. Integi-ate the industiy 
standard EISA UO bus ini.o the Serit^ 8CK» himlware for the 
first lime, and achieve our ct*st and schedule goals witluHit 
any investment iti new I'LSI ASICs. 

hi the end. the D-class design had leveragefl from all current 
entiy''le\'el and midrange Series 8<M) servers and numy Series 
700 workslations- Because of the care taken during the 
adaptation i>rocess, perfonnance enhant-enients made lo 
the original dc^sigu soyr<*es were made a%'ajlable in the latest 
D-class module qiikkly ;md with ver>^ Utile invest numt. As 
an exant|.>k% 4Hily two weeks after a liu'ger-cache, higher- 
speed PA 7200 K-series processor nwdule w^as released* the 
coiresjionding D-class PA 7200 processor module had been 
nujdtfied and tpleastnl foE' i}it)totyi)e. Ttiis rnotiule provides 
fotir times the caelu^ and a 20^o incre^ise ii^ hetiueitcy over 
tlie initiiil D-class PA lliH) module. 

Table 1\* suiiiraarizes the leverage sources for the various 
subsysteitts that make up the I>€liiss ser\'ers. 



Leverage Sources 
D-Class Subsvstem 

EISA l/i ) 
IIP^lISC*I/0 

docking 

I*A 710f)LC I^ocessor 
Modules 

PA7j!00and 1 'A 8000 
Phk essor Moduli^s 

Pi\ eJOOLC Processor 
Modules 

Power Su])itly 

Hot Swa}> Disk .^rr iiy 



Table IV 
far the DC lass Subsystems 

Leverage Source 

HP 9000 Series 700 Wtjrksiaiioris 

K'Class sener and J-class and 
HP mm) Series 700 Workstations 

E-dass antl K-c4ass Seivers 

En^lass Serv-er 

K-class Ser\'er 

C-class Server 

hid ustrij^^-Stimd ar d Su j ) j )h e i^ 
HPs Disk Memory Division 



)Copr. 1949-1998 Hewlett-Packard Co. 



him mil \h'w\i^t\Piivkm\ .Irnirnnf 81 



A Low-Cost Workstation with 
Enhanced Performance and I/O 
Capabilities 

Various entities involved in product development came together at 
different times to solve a design problem, evaluate costs, and make 
adjustments to their own projects to accommodate tiie cost and 
performance goals of the low-cost HP 9000 B-class workstation. 

by Scott R Allan, Bnice R Bergman n, Ronald R Dean, Dianne Jiang, and Dennis L. Floyd 



The design and development of the HP 9000 B<^lass wf >t'k- 
stalion is a good example of cooperative engineering. 
In cooperative engine eriiig, the various entities involved in 
product development come together at different times to 
solve a problem or nmke atijuslmenls to their o\\ti projects 
to acconunodate a comnu>n [leed, Kxamples of I his co- 
operation for The B-chiss workslation inrhide ronrdinaiion 
between system designers anti firniware deveh>|jers, the 
addition of new functicMiality without impacting ttie fJevelop- 
ment schedulPj close ties with maj^ufaciunng, evahiET.ti(jn of 
imjjlementation based on detailed cost models, anti simplifi- 
cation of the PA 7300LC' design by moving clocking fonctions 
onto a small chip on the system hoard. 

Design Objectives 

The design objectiv<:^s Ibr ihe B-class workstatitxn were low 
cost, quick time to market, performance, functionality, lon- 
ge\ity, ajid modularity. In additi<:in to these objectives, the 
deveio]jmen( teants maui goal was to yiroduce a workstation 
based on the PA 7300LC' processor that would be compar- 
ably priced to the HP 9000 Model 715 workstation, but witii 
suijerioi' i>erformance and I/O eapabilit ies. Tiiis goal and the 
design objectives remained the same throughout ihe project. 

With low cost Jis the primaiy object iv^e, any feature that was 
perceived as too costly or of limited value to oiu' customer 
base was not included. U^veraged subsystems were reviewed 
in search of creative ways to retiuce trost. This led to reduc- 
tions in tlie cost of the clock circuitry and firmware inter- 
face iuid elimination of some legacy I/O ihterfaces. From a 
cost/performance pers[>eciive we were al)le lo justify the 
addition of a PC1 (Peripheral Component Interconnect) bns, 
a higher-speed memoiy technolog\^ a second-level cache, 
and a lugher-performajite processor and grapliics subsys- 
tem. Fig. 1 shows the B-class workstation. 

Features and Capabilities 

Based on the objectives for the B -class workstation, the 
following featiues are included in the product: 

• PA 7^300LC high-perfomiimce. low-cost microprocessor witji 
two on-cliip iissociative caches with iK bytes for data and 
64 K b>tes for instructions 

• IM b>1es of ECC (enor-coirecting codel directly mapped 
second-level cache for additional performance 



. HP VISUAIJZE grai)hics teclmology from IIP VLSUAIJZE-EG 
(en I ly-levei graphics) 

• HP VISLIALIZE-IVX graphics on the BLJ2 workstation 
(optional) 

• Six memor>^ slots that supi:)ort up to 758 M bytes of ECC 
jiiemoi> inrhidmg fast'])age mode (FPM) and extended- 
data-out (EDO) DRAM dual inline memory nioduJes 
(DlMMs) 

• General system comiect (GSC ) bus for lugh-speed I/O 
l)aiidwidth 

• Flexible VO that includes two I/O slots, which can be 
conflgm'cd as; 

- TV^o PCI slots 

:: Two GSC slots 

( )ne EISA slot 

• Optional fast-wide SCSI (20-Mbyte/s) carci that supports 
internal mid external disks without using an VO slot. 




Fig. 1* The B-cbss \vurk:st4itit>n. 
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Fig. S. 11 u' system bioek diagram for the B-class worJcsftatiori. 



)Copr. 1949-1998 Hewlett-Packard Co. 



Jiiri I \^Ti \ lew] f tt ■ Pat'karii ■) < iiiniu I S3 



In addition lo these features, the B-<lass workstation's rnod- 
nlai' tie-sign |>rovides sinipli^ inHtallation, nexihiltty of iLse, 
and easy semciiig. Tliis is act-oniplished througl^ design 
feaLurt>s such as: 

' Simpk' tiay design 

■ Biiilt-iii pxj»aji{lal)jlily 

> Pi iig-i n 1 1 let i it >ry 1 1 1< n I u J es . 

Fig, 2 shows a bloc^k diagram of the components l hat niake 
ni.) the B-class wfukstation. 

Processor and System Design 

Sinr<* the processor chip used in tlie B-ciass profkicts is 
ilie PA 7'^0i\LC. one of I lie maiji areas of cooperation was 
between tlie PA 7300L(' processor design team and the 
B-class system design team. 

The previous-gen eratior^ prfjcessor ast^d in IIP workstations 
of a eomparabk^ price was the PA 7100 L(\ The PA 7100LC 
was an extremely versatile processoi'. and majiy of its 
f>est points were leveraged into the PA 7;]tJ0LC design 
(see the articles on pages 43, 48, (il, and 69). However, 
the PA 7ltJ0LC' was not without its challenges, such as 
the ditli! ulty in synchronizing the processor clock with 
the GSC (general system connect) bus. 

Cluck Frequency 

Tlie (jSC bus is a general-f impose synchronous l>us used to 
t'Onmumicate between tlie processor mid I/O. Its phase Ls 
deterrnined in relation to a nonexistent GSC clock. This 
imaginary clock nms at half the frequency of the t lock sync 
signals tirivcji to each GSC de\ice. Its rising edge is defiiu^d 
by file rising edge of reset during initialization, ^md each 
(iSC <levi<'c IS resijuusiljle foj keeping track of the cm rent 
phase of liic GSC clock starling from initialization. 

On the PA 7I()0LC, the GSC bus was only pennitted to oper- 
ate at Hxed ratios of the processoi' clock fre<iueucy, including 
some od(J clock raiios such as i.5il (see Fig. 3). All of the 
clock syru s and the resets used to initialize the GSC clock 
were cxlermd \u ihc riii|i. Designing circuitry to niaiuiain 
these ratios and timing maigins vvitli minimjil clock skew and 
noise immmiity became increasingly ]:iroblematic. In addition, 
every rre<|uency point of operai ion required a si>ecial clock 
design to ensure nuiximum i>erformmic(^ This limited our 
al>ilil y to select the IVe<|iuMiey of operation l>ased upon yield 
at a later point in the design process. For tlie PA 7300LC, the 
situation became moti^ eiitical I because the final processor 
frequency wa.s still unceilaiu, and the final ratio between the 
processor fictjtii^iu y and the GSC clock was also undecided. 

live t'li'st ap|>r'oacii investigated was lo lulng the entire flock- 
ing scjluttcjii iiuu I he PA 7'MMA\ \t would lie much easier to 
actjust the delays and control the skew within an ASIC rather 
than in discrete t ircnits. Tlie proposal wa.s to incorj^orate a 



PCLK = Pfocessof Clock 

G&DSYNC = lieneral System Coniteci Sync Signal 

Fig, 3, The PA TKXHr f lotkiiig scheiae. 



|>hase-lockerl loo[j <rirruil wit 1 1 in itie t*A 7;^0ULC to generate 
the processor clocks from a iow-frtniuency external crystal. 

The GSC syncs could then be created by dividing I he 
phase-locked looj) output internally bi the PA ISOOhC. The 
PA 7;300LC would also drive out the reset used to initialize 
the GHC phase. Cpon furtiier investigation, the PA 7ri(J{)tX- 
design team became concemtxi about the risk associated 
with tlie ptiase- locked ioojr The phase-locked Irnjp was con- 
sidered a major component of the PA 7;300LC design. This 
was sigjiilicant because all post-fabrication verification and 
debugging uf the chip would be depc^ndenl upon a functional 
phase-locked loop. 

At tills [joint, tlie B^class system designers jmd I be PA 7-3(X)LC 
design team began to luok at a niixeil sohition. The phase- 
locked loop was scrapped to avoid risk, ami its die area 
recovered for other uses. The PA 7'30t)LC woutd cmrtinue to 
drive the priinaiy syjK'hronizing reset to eliminate ilie need 
lo synehrouize the asynchionous poweron reset to tlie GSC's 
syncs. The generation of the syiics and tiie maintemuice of 
tlicir skew requirements would be moved to mi external 
ASIC. Any uecessaiy turns to a small ASIC wouhl l)e quicker 
and less expensive. In addition, the clocking solutiejn could 
be completely bypassed to allow continued verification ant I 
del nigging of the PA 7^it)(>Lt: it iiecessaix 

Working with Mf)torola, the PA 7300LC design temn, ^md our 
materials organization, the system designers specified tlie 
device^ that became the Moloiolti MPC992 ( tlu^ clock genera- 
tor in Fig. 2). Tliis ilevice uses a phase-locked loop and an 
external low-frequency crystal to generate dilTerential clocks 
that provide clocking to the processor and the other GSC 
de\'ices. As an added benefit ^ it^s cost is relatively low in 
relation lo the external ckjt k oscillatrM and RClj devices 
used in pre vioiLspjud nets. The USVNC signal, which ccjines 
from the PA 7300LC i>rocessor, is the synchronizing signal 
that is responsible for aligning the GSCSYNC with the proces- 
stjr clock signal. 

Memory and I/O Controller 

The proximity and working relationship between the 
PA 7'M)0L(^ and B-class system design teams allowed iis 
to communicate design specifications with relative ease. 
This winking environment allowed us to view the product 
as a whole rather than designing the system around an 
existing chi J). 

The design f>f tlie memory and I/O controller (MIOC) was 
the fn^st iu^a affected by this arrmigenient. The PA 7300LC is 
designed to support optlimtil second -level caches of different 
technologies and sizes. \Mien the PA 7;i0f)LC chip design 
team began investigating each of these second-level cache 
options, the B-elass system designers were able to check 
ihe api>i oj>riateness of their solution with the design. One of 
the first det isium> ujuler this arrangement was to make the 
second-level cacrhe fjptiouiil and locate it on a DIMM (dual 
uiline memoi^' module) on a sepmate boaitt Tliis t>rovidi^s 
Ibe B-class workstation witli seveial benefiLs: 
Lower-performanci^ systems me not burdened with the cost 
of the second-le\'el cache. 

Systems with jmd without a second-level cache can share 
system Ijoards, reducing development and \'eri fication time. 



84 



.Iiiiu' Uii^T Hcwiptl-P^irkuird JntiniiiJ 



)Copr. 1949-1998 Hewlett-Packard Co. 



* The exact cortfiguration of ihe second-level cache can be 
altered at a later date if market conditions warranted. 

► Less space is required on Ihe board, pennilling a lower-cost 
s>i>ieiir Ixjard, 

It vtBs importaiii for the PA 73i.iHJLC' design team to know 
tliiil a t*IMM solution was l>eing considered since it would 
Imve a big impact on ihe l/'t) pad design of the PA 73(M}L(\ 

.Another area cjf coucem within tlie MK)C' iinolvi^l ihe 
impact of the ex'j>ande<i data bus on the PA 7;i00lX' i 144 bitsj 
compared to the PA TKMJLC (72 bits). This would require 
additioniii pin.s and incur additional packaging coste. Tlie 
PA 7^^(K)LC design team Wiuued to shmi* tiie memory data 
i>us with Ihe second-level cache data bus to redu<*e the nunv 
her f>f external I/O pins. However, the additional load aivsoci- 
ated with the memoiy would degrade the response of llu^ 
secon<i-level cache. Tlu^ PA T^iOULC* design team suggested 
FET switches, which could be dynamically opened and 
closed to isolate the second-level cache from main memory. 

The B-class system designers were able to verify using 
FET switches in a system en%1ronment. However, the only 
devices available that met the enaljle/disable speed require- 
ments weie 8-hit de\ices. Tliis was \iewed as an iinvviekly 
and i^xpensive solution in the B-<4ass system. Wcnking with 
our materials orgmiization and Tex els Instnmients, the B-ciass 
system designt4>^ were able to nuike minor s|)eririt alion 
changes to au existing 24-bit Texas Instmmenis pai1 to iiu- 
prove thisHpeetl i>arameter and cut the qmuitity and fost of 
the P'KT switches signlllcamiy. The B-elass systeia flesigners 
verified the signal tjuality of the memor>' data and second- 
k^vel cache data of these devices in a system euvirotunent . 

As the conllguratJon of thesecotirf-lt^vel cache solididt^l tlu^ 
B-class syKleui dt^igners wert^ al>le to jUinide the P\ TiuiitX' 
desigfk team willi specific iiifoniiatinii coiuu'niiiig the eh'tiri- 
Cril etivtronment in which the PA 7o(M)li/ would be uperaling. 
Witli litis infomtation they were able to nin sinnilatitms of 
their lA) t>ad drivers o])eniliiig witliin itie actual systeiti. 
Tins led li^ some cEiaitge.s in Iheirt^ad designs, eluiiitiating 
potential problems later. 

Memory 

As with most jinjjt'cis, the PA 7'!iMJL</ design tenm and the 
B-class system designei's had tlieir share of resolute siifjil- 
ages. One sucf^ issue ittvolved the metnorj' family. The 
PA 7'}i)i)LV isd(*signed to suiit)on b(»th fast page nuKieaiul 
extendefi-datH-<Mit URAMs. hi fast-page modi , seqiu'iitiHl 
data is driven from the DF^AMs <m successive col lunn ad- 
dresses bilUming a single row address, rather than retinij iug 
t)oth the row ^ukI coiutnn aiidress to be driveti on eacli data 
access, Extemk^d-data-out iJlLAMsareau eiihancenieni lo 
fast-|>age mo<k^ l)t-LAMs iu vvhirti liie data reiuaiiis vjilid uulij 
the coluinn addres-s chiingt^s or a new column adtiress stroln' 
oecuT's, ryiher than hecojning invalid when liie 4'ohnnn 
Hddres.s stn>be disaptn^*u-s. This allows a longer titne period 
over ^vtiirli In latch incoming data and saves processor 
states in mejnnry accesses. 

I 'nforttmately* resoiu'ce conflicts and siliedtile constraints 
made it imt>ossil*k^ for tlie PA 7;iO()LC* dissign leatn in verify 
nmcti<mahty (jItIk' clii|i for btMh itieiuoi^ technoltjgies. 



The PA T300LC design leant wanted to qualify the exteniied- 
data-out DRAM teclmologj' l>ecaus«* it would prD\1ile a higher- 
perfonnance memory technolog>. Tlie B-class system design 
I earn wanted the fast't*age mode URAAJ technology' qiKdified 
lo be compatible across tlie workstation family, rather than 
ha\ing a iinkiue memory component for the B-class syslenis. 
Tlie c«>nipromLse solution was to have tlie PA 7^JtjtJLC design 
team qualify tlie fast-page mode DRVM leclujolog^^ for first 
relea.se. At a later point in the design phase, the B< lass 
system designers would qualify the operation of extendetl- 
data<iuf mode DRAMs to be introduced as a perfomtajice 
enhancement. 

Data Capture 

Resource balancing was also evident in die development 
of a data capture l>oai'd f<*r Uie PA 73(M)LC. A data capture 
board is a de\ice that is attacheil to a system board and is 
used to obser\'e tlie high-lrequency signals between the 
processor, second-level cache, ami niemory for debugging 
puq>oses. Siiit e the B-c!ass syslt^rn designers wei'c mtire 
familiar with board design tools and the hoarti <k'sign etivi- 
ronmenl. the B-t^lass system dt^sign team devek^ijed the data 
cajiture boaid for dehugglivg thi^ VA 7'M)LC, 

Hardware and Firmware Trade-ofTs 

Design teams frequently look at iratle-offs between optimiz- 
ing resources and meeting the goals of the team. For the 
B-ckLss workstation, the hardware and firmware I earns 
fostered a ch)se working relationship, allowing tradtM^ffs to 
lie tnatle on a broatler scale, 

A n\ost unusual but significant nut come of I his < lose w cork- 
ing relationshiti wiis the (UnvkitHivent uf an unjilanned ASK ' 
for interfacing to the processor deptmdent hardware (PDH). 
The PDH consists of coniponenis such as the boot ROM 
ntjrivohdile memory, and t oivfiguralion registers. Alihougli 
there was already a way lo conntu t b) the 1*1)11 l'nn<linnality 
ihiuugii pan <if thi^ core I/O logic being leveraged Iroju tui-- 
vious lower-end workstations, this interface did not provide 
the levt^l of fun( lionality that was implemented in the higher- 
end workstations. The jlrmware team conki s^ive signilicant 
resoutces by hn en aging porlh>nsi>f code frcnn the t'-cktss 
woj kslalion mid ihe higlierdul niembei^ ol the D-class 
sender faintly Many of the basic I/O and grai jhics funtiions 
wvrv similar be I ween lb esc |)lajf<jrm.s, Ihnvever, Ihe cocU'^ 
leverage was predicated (n\ liavjiig certain PDH functionality 
tlial could not be providi*d with tla^ low -end solution. In 
addition, the high-etid solution [irovided suiJeritjr ilebug ta- 
pahilities. These i>etter tk'bng catmbililies were veiy attrac- 
t i v e to h e 1 1 1 e n s u re a sj >et* t ly si a rt 1 1 p o f 1 1 le n e w P.'\ 7^t( K )L( ' 
pr<M'essor. and hence luvlii nieet our tinie-lo-niarket goals. 

The key cat>ability missing from Ihe PDH inteiface used in 
previous low(*r-end workstations was the al>ility to t>erforjn 
word- wide writt* accessc^s to PDM d(»vices. The PDH inter- 
face was (optimize* 1 for reads, wit It only hyle-wiite t at abili- 
ties |Movi<k^d. TIh^ new PDH .\SK' athk^I Ihe wttrd-write 
capalMlily to su]>t"*^il a scnut h RAM rhisstHniingJy inno 
cent scratch RAM wiis key, Ijccause In higli-end workstation 
code it is user! as a slack in tht* early stagi-s of tln^ tioot jiro- 
cess before main mernoiy is niilialized. Tlie scratch RAM ts 
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also used for global uifomiatioii such as tables of 1/0 and 
^raphit's confi^iiraTioi^ inforniation. It would h-dvc been \Tiy 
(JilTicuit to levera|?e code with ihe word-vv^iLt> capability to a 
piatfonu without this capability. 

The new PDH ASIC" also provided additional address ticcode 
and the appropriate llexibiiiLy in timing to allow the tliiect 
connection of a serial port into the PDH hardw^are. This 
direct (^onnccliou \o a serial poit. in coivjimction with the 
t at>ai>ilities tjlTered with the scratch RAM, allowed a debug- 
ger to be oijcrational even widi hanlware (hat wlls nutiimaily 
fniK lioiiaL This serial pon aided code and hmxiware del>ug- 
giiig by allowing tiie hardwane status to be niouitor'ed and 
the hardware configuration to be modified early in the l»oot 
process. 

The risk foj I he new PDH ASIC w^is niioimized l>y incori>t>- 
rating il intt) system Hsimidation cffoits and Ijy kee^fing (he 
design focused ou the needetl fmictiunahty mid disallowing 
any uiuiecessaiy featiures. 

Product Definition 

The B-Clttss system was originally defined alongside the 
C-class workstations. Tlie B -class system is essentially a 
smaller version of the OcJass workstation. Our original 
mtentioji for the B-t lass hnj>leiiienta(ion was to use the 
same modular ])hJlos<jpliy orsej>arale l/(h (Pl\ disk inter- 
face, and human interface subsystems usefi in the ("class 
machines. However, when the time came to implement the 
B -class jjrodurt, cost goals had become more ini]:toriaiit. 
^\lien preliminary' costs were evaluated, it became cleai" that 
we w ere not meetmg the cost objec rives W'ith the existuig 
pi'0( 1 lid { lefniit ion , 

Many alternatives were generated and evahiated againsT 
product objectives. Finance and Ri^D reviewed Iheir cost 
models to see where costs c^ould be saved. Maiiufactming 
re\iewed the design alternatives for mamifactioaljihty mid 
anidyzed the supi>ly chain for issues asst)cial(*d with paits 
procurement, assembly, material, and structure. Service was 
considted to review serviceability antl warnmty implications 
of the various options* as well as issues with poleiitial future 
upgiade products. The result of diis analysis was a single- 
board integrated computer' (see Fig. 4). The tiesign. which 



wns initially spread out over four separate boards in the 
(_ -class system for the sake of modularity, was now inte- 
grated onto one system board. 

Single -TVay Concept 

Like the C-chiss workstation^ tJie B-class workstation uses 
a tray concept. How even instead of two trays (one for the 
disks and one for the boards}, there is one tray that holds 
ever>1hing (see Pig. 5). For this reason, during the design 
phase it was important to consider keeping the weight 
down. Holes were adde<i in the tiay wherever possible to 
reduce I he overall product weight. 

EMI 

The tray assemibiy slides into a metal can. With tliis appraactu 
the EMI (electionmgnetic interference) interface is limited 
to the perimeter of the reai^ |)aneh Once the tray is removed, 
there is easy acf^ess to the option bom'cLs, memory modules, 
second-level cache modules, optictnal fast -wide SCSI interface 
board, power snpply, disk drives, spc^aker, fans, and the CPU 
chip. Ti\e system board is accessible by remo\irTg liie disk 
bay, which is sectored l>y only one screw^ and a few cabies. 

Disk drives vim be accessed without remf.jving I lie disk bay 
fiom the main tray simply Ijy removing the snap-on cover. 
The disks are mounted using plastit^ brackets so tliat they 
can be changed wit hoi i! tools (Fig. t>). A f'<m was added to 
tlie botton) uf the disk buy to provide enhanced tiisk cooling 
since successive generations of tUsks consume n^ore power. 
Removing the barkplane is sliglnly more diOlcu!!, reutriring 
all rno<fules to be rerncnetl first. 

Man uf ac turing 

Working with manufactiuing included performing a supply 
ihain analysis^ £is pai1 of the total system cost analysis, 
Desigr^ effons inoduced detailed mateiial lists that were 
used to determine an overall system ctjst. Several design 
scenarios w^ere de\'elo]7ed witli niechanically exploited 
models and n^aterial lists. Tlie cost model for the B-class 
workstation was not limited Just to the material content of 
llie product, l)iit also included system intercomiect costs, 
]jarts procurement costs, part placement costs, prmted cir- 
cuit tioard electrical and system fimctional testing costs, and 
system support costs. 
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Power Siipp*if PCI. EISA 
and GSC Baanls 




Fi^. 5, TIh' mum imy ns^sembly. 

Mitrui farm ring and neld-^support representatives were in- 
volved in deFinirtg l he system for niaxiurantirability. Inven- 
tory control and confignrabiiiiy to reduce ihe sysiem cost. 
Design Jicenarios were then evaluated against eaeli design 
otjjettive. 

Initial prototypes wen?* assembled and disassembled by 
manufacturiiig pei^onnel to provide a liands-on eritique of 
I he designs. These inputs were fed bark into the design in 
I he early stages of de\ elopment. 

Serviceability 

One of rhe challenges of this single-board solution was lo 
make the system board accessit)le for seniee. We wanled lo 
liave the board slitte in anci out. InU lliere were eonneetors 
and svviiehes r)n liolli edges (jf Ihe boanl In afldition. the 
ctmnvvUirs had to be accessitjle through llie rear panel 
To allow the board to slide into the par-kage we added a 
smidl Iniv tf> tlie iHitttini of the system hoanl that ronld sh<Je 
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along card guides. One of the design requests from seniee 

represent am es was lo be able to service the s>'sf em iwaixi 
withoni removing the rear iJ«inel To aceompUsh this, the 
rear-panel (*onnei*u>rK were reressetl and a separate small 
bulkhead attached \^ith a slidiiig EMI interface to the rear 
panel This bulkhead remairus attached to the bo^ird in a 
boanl replacements 

AntJther s*^r\iceabilily concent was the alignment of the 
power button, mute button, volume knob, audio jacks, and 
LEDs located on the front of the product. In the chassis, the 
main tray engages aJigimrent puLs, wluch sen. e to lock the 
imy io the chassis during \ibration aiKJ sliot^k Bec^ause of 
the lolenmce stackup from the front piUiel through the ciin, 
tray, back|>laiie, system IwiU'cl imd all ihe connectors imd 
buttons, we were concerned thai die cosmetics at the front 
w^ould t>e unacceptable. To improve alignment, we mounteHl 
these connect 01^ on a Umg. riuEi section of t lie printed cir- 
cuit board flint would Hex and besut>ported by a metiit 
brace so that the front section could rno\ e relative to the 
rest of the board. We added aligi^tng forks to the front panel 
to positimi just that sicction of boaiTl With fliis method, we 
were able lo locate these connectors accurately. 

Mantifacluring also assisted in improving the design through 
their i)ariHi|>ation in design reviews. One suggestion lead us 
ti> abaniJtjn the captive rem'-panel fasleners that we had 
been plarming to use. If the captive screws ai'e not properly 
aligned, they can be c-rcjss- threaded and strip fieri, <ir the 
captive nut on the chassis may be damaged. Consequently; 
tlie whole tray would need to be replaced just for a simple 
nut or screw. Instead, w^c designed cnsf oni thumbscrew^s 
with tm unthre^aded luise section to align the screw before 
the tlueads engage. This minitnizes <ross-th reading. T*t save 
labor costs we alsf* used a coai^e thread to reflu<'(^ the uum- 
bei- of Imitations necesjsiiry to remove and lasiall tluin- 

Allot her goal was io reduce the immber of screw tyiies. W'e 
tried la standardise on a single screw used in our earlier 
opii**n boards because this was the one screw that we ccnild 
nr>l irbangc. We used it to atta^ii the |>ower sntiply ant! tltc 
disk bay* To reduce screw count, the niaiti fiuis and sj^eaker 
snap in place. The backplane slides in place with keyhole 
standoffs and forks in the main tray: When the |)ower supply 
is ir^stailed two [nns from the f>uwer snt>ply tnip \hv back- 
t)lane in place. The jHUver snp|>ly is sujiptmed vvirli two 
screws atvd the two pins thai are routed ihnnigh the hack- 
plane into the backi^lane suppori. The powtT supply has 
floating connectors so t!iat stresst'S frr jin vibration and slioc k 
aie not tninsmitted betwet^n iht^ backplane and ihe [icAver 
supply via the connectors. 

(hie of live pri inar>' f>i)jpctives was Uf>gradai)ihiy. r[>gra<les 
can be easUy accornj>lish(^d l>y a simple^ swap of tiie system 
board. Since everything is on one board, there are no issues 
wiih incmuijalibililies between d if ft Tent versions of the t/O 
and VPV. The small l/( > bulkhead slays with the boaixl so the 
main tray assembly need not change. Sufficient extra height 
remains where* tht^ CPL' and memtny arc* local i»d so that 
future high-power CPUs have* room IVn^ larger heat sink^ 
or 4*ven small daughter boai'ds if more boai'ci real ^-siate is 
needed. 
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Processor and System Verification 

The verification eflbrt for the PA 7300L(^ md t\w H-cUss ajul 
C-rlass i)rr)diK;Ls was alstj ajdint (^ffoit. Shnioo tests were 
condurled siniultaiieously on both tlie B-c lass and t.'<'ia.ss 
work slat i«>ns. 

A shmoo test is designed to verify the produet undei^ v ull- 
age, teoif>eratnre, ;md frequeney extremes. Its inienlinn is lo 
elerui rally stress the system under test to within antl be- 
yond lis ejieratin^ limits. Tliif^ praeess is part of our elect ri- 
ciil chaiarlerizatir)!! of the jHocessr^r anci system. A shmoo 
test is ail iiniK)i1ant part of ourjirodnet development eyele. 
By pushing the system to its eleetr ital exln-rnes, we hope to 
reveal iiny deslgfi we£ikn€\sses tiiat et>nld affect the r:)peratlon 
mid perforniaiice of the system under extreme operating 
conditions. It often uncovers weaknesses in both chi}) and 
board flesigns. These nii^ht inelude signal cross talk, chip- 
drive capal)iliiy, slow-speed [>aths at high temperatnies, ni 
boJ:ii'd-level clocking proljknns. 

To achieve superior product quality, both processor antl 
system shmoo tests were performed on B-cla"iss sy steins, 'f he 
|)rf jcessor slmioo test focused on the core [processor, caches, 
inriuoty, and (iSC [jus. The system sbiiiuo test emphasized 
IHTiidierals mul VO, inchiding the escpansion I/t> on the GSC. 
KISA, and I\;il) uses. 

Since the PA 7.30OLC' was designed to work in both B-class 
and C~class systems, it w^as tested in both systems. Proces- 
sor characteri;5arion was performed in the C;-class systems 
by (he PA 7']00l.t" design tetan. Simnllanetnisty, th(^ B-class 
system design team completed the jiroeessuj shmor)s in a 
B-cliLss system. Both the B-class and C'-class system design 
teams completed system shnioos with the PA 7;J()0LC in 
their respective environments. 

The pai'allel verifications of the PA 7;¥)0LC in the* B-class 
and C-class systems coniplemeiited each olhcri providing 
(jpijoil unities for leveraging and maki^ng the debug iinuess 
go smoother (Jne of the issues discovered during the pro- 
cessor shmoo test was tlie limited operating frequency of 
the GS^C tuis. This was caused hy the length and load on the 
bnsm^d a llireshold problem on the PA 73()()LC, The eoiii- 
laned efforts of (lie* PA 7300LC j process or a [id tJ-cliLss system 
design teams extended the operating frequency of the GSt"^ 
Ijus in our systents aivd provided the flesired pcrformmice. 
The PA 130{)LC design team coneeted ihe (firc^shold problem 
and the B-class team shortened the GS(.' bus, which shghtly 
( JKinged its characteristic hiipedance aiid helped to alleviate 
the problem. 

Processor-level electrical verifieation has three main goals- 
uncover electrical ( nonfLuictional ) bugs in the system, fhid 
critical speed paths Hiat limit the maximum frequency of the 
processor, and provide eorrelation between Ihe IC tester 
heqiteney and the eventual system frequeney. The third goal 
had the biggest impact on costs. As de\'elopmeni prfjgiessed, 
it became obvious (o the PA 7:100L(" and B-class teams that 



(he fretjuency mix (132 MIIa \o !&) MHz) between die IC 
(ester and the system was no( meetmg niiirketing rtHjuire- 
meiils. The correlation effon betw^een the teams uni*overed 
ways to enJiance the sysiem elect rical an<l thermal cnvlrrm- 
nients to bring the yiekl mix atul mark**! dematni (Mgethen 
The close cooperation between the tw^o teams enabled 
the (tuick identification of a solution to the problenu We 
made alt eratioris 1(j the system s I hernial cooling environ- 
ment, allowing us lo run ihe PA 7:JbOLt' at a higlier fre- 
qutmcy, sometiuiig we conld Jiot do in the original cooUng 
enviromiienb 

(h^er tlie years, many effoi-ts have been made to address aiid 
imiirove the shmoo tc^st prcjcess at both the proc essor 
mid system levfd. While processor shmoo (esling reveals 
many systt^ni level jiroblems, its primary focus is still (he 
tirof essor, cachc\ and inenK>ry subsystems, rather than the 
I/O subsystems. As I/O bus speed and peri]jheral interface IC? 
<<)[iiplexity has inc^reased, it has become n\ore im|>oi1;mf to 
address the I/O sulisysiems in shmoo testing. The PA 7;>tirjLC 
was designed to make conqdete system shnioos more j»ra<:- 
( ic al foi IhLs reason. The clock circuitry for the PA 7:3O0IX' 
was designed tfj permit oveiTiding the nominid clock fre- 
(]uen( y while maintaining The corrcd synciiromms reiaiion- 
shi^i l>e(wecn tlu^ prticessor ant( I/O clocks. This allowed ns 
to vary Ihe frequency of operation more easily over a Imger 
range of operation than in past products- 

One of the challenges for system shmoo testing hi B-ciass 
systems w^as the range of new system components thaE had 
to function (orreetly together during tesring. As with the 
processor shmoo, system testing attempted to stress the 
electrical design of the new components by operating (iiem 
under extremes of tempera! n re, voltage, and frequency. In 
additi(jn (o tht^ core UO components, various exjiansion 1/0 
cards were selected to verify complete system fimcrionaUty 

The extensive system slunoo testing of the B-class systecu 
led to Ihe optimization <jf several circuits and resulted in a 
[ligtieepi^rfoniiing, more robiLst system. We have come to 
believe (tia( shmoo tests aie ;m indis[>ensablc part of our 
[irodnct developnicnt. Besides helping to calch i>otential 
problems belbn^ irtt rod uf lion, shmoo tests also make jiost- 
product support and maintenance easier* 

Conclusion 

C'ooperaiive efforts between numy functional areas such its 
nianut at luring, scnice and sni>port. marketing, firntware 
de\elo|aiient, and the PA 7::iOOL{^ clnp develoiDment team 
together with (be electrical and mechanical system designers 
have prodticed (fie B^iass w(jrkstalious. The closely coupled 
system design ai>proach has yiekied a workstati<m that pro- 
V itles signifieant value to our ciistomei's. 
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Testing Safety-Critical Software 

Testing safety-critical software differs from conventional testing in that 
the test design approach must consider the defined and implied safety of 
ttie software at a level as high as the functionality to be tested, and the 
test software has to be developed and validated using the same quality 
assurance processes as the software itself. 

by Evangelos Nikolaropoiilos 



Test technology^ is rrmial for successful prorlucl dcvelop- 
ment, Inappropnale or iatc tests, imderesli mated testing 
effort, or wTong le^t technology choices have often ItHi 
projecis to crisis and fnistratjon. This s(.>nwcire crisis ^c^sults 
from nt^glecting the imbalance 1 jet ween coiislriJtiiw sijftware 
enghieenng and anal vile quality assurance. In this culicit* we 
exijlain the testing coiirepLs, the testing terhnit[ues, aiid the 
test technology approach applied to the [jatient nionitoi"s of 
i!\e IIP OnmiCare family. 

Patient monitors are tVlecironir inedicid tk'viees forohsen- 
mg critically 11] t>atients by monitoring their physiological 
pjirametei-s (E(Xi, heat1 rate, blood pressure, restnrat.ory 
gases, oxygen saturation, and so on) in real time, A monitor 
can alert medical personnel when a ])hysioh>gjcal value 
exceeds preset limits and can report the [jatienfs status ou 
a variety of extc^mal clevici^s such as reconlers, t>t inters, and 
computers, or .simply semi the data to a network. The tnoni- 
tor tttaititains a (talahase of the t>hysiologicid values If* show 
the Iretuls of the patient s status and enable a variety of 
calculalitms t:tf flmg dosage or vpntllatitni and hemodynamic 
f)aranieters= 

Patieni monitors are tised iit hospitals In operatit^g rfjoms, 
iMuergfiMw rfjon^s, and intensive care units and cati he con- 
figured for eve r>' (lalit^nt rat ego ry fadult. periialnr, or neo- 
nate). Very ottett Iht^ tjaiieni atiachetl to a tuouitor is tuu^jn- 
scions and is sustain e<i by otlu4' tued kal dev ic<^s such as 
ventilators, anesthesia niachioes, fluid and drug punips, and 
so OH. These life-^ustainmg devices are interfaced with the 
padent monitor but not controlled from it. 

Safety and reliability re<itiirements for mcMliral devices are 
set very high by iiidiisti^' and reguUilfiiy authorities. There is 
a variety of intertiational ami national standards selling I he 
rules for the deveh*pment, Juarketing, and use of medi<*al 
devices* The legal requirements for electronic medical 
devices are. as far as Uwsv coneent safety, toniparable to 
those for Muelerir jilajits and ah^craft. 

In th{' jiasL llie safety reciuirements eovereil mainly iJie hard- 
wyre aspe* Is cif a devi*n\ siiHi as electromagnet ir rnitipali- 
liility, radio inti'iferenee, electrcjnic jiaiis failurt^ *uttl so on. 
The concern for software safety, aec*enttiated by sonte widely 
known sofiwan* fjulures leading lo patient injury' or death, is 
imiH^asing in the industry ajul the regulatory bodies. This 
conc**ni is atldiesstHl in juany new stantlajxls or fiirectives 
such as the Meihcal Device Directive of the Eiuopean Uruoii 
or the ILS. Pood and Drug Administratitjn, These legul 



requirements go beyond a simple ^-alidation of the product: 
tliey require the maiiufarturer to provide all evidence of 
good engineerijig practices during development and v'ahdii- 
tion. as well the proof that all jrossible hazards ttoni the use 
of the medical instnunent were addressed, resolvexl and 
validated titiring the development phasea 

Tlie dev elo]3nient of the IIP Omnit'are fmiiily of j^atient moni- 
tors started in the mid-i080s. Concern for the testuig of the 
complex safety-critical software to validate the patient man- 
itoi's led to the definitifju of an appropriate testing process 
biased on the iVNSL/]EEE software engineering standards pub- 
lished in the same tinte frame. The testing prot^ess is an itite- 
gral part c>f our quality system niv\ is coruinuonsly improved. 

The Testing Process 

During Ihi^ sijecilltiuirjjis phase of a protluct, extended dis- 
cussions are held by the crossfunclional team fespeciafly 
the UM) and software quality enginet^ring teams) tci assess 
the testing needs, Tliesediseussions lead to a lli'st estimation 
of the test lechnojogy ne^ ded in hII phases of the tleveh>p- 
nient (test technology is ujulerslootl as the set t>f all tt^st 
environrtients and test ttK>is), In the case of HI* patient mon- 
itors the diseusshm staric^l as early as If*8Hand eontjnui's 
with eveiy^ new^ revision of flu* putient iiioiiilor raiiiily. rellii- 
ing and in some cases red** fining the lest teehuology Thns, 
the test environmenl with all Its coitijHaientsanfl da^ tools 
for the functiottaL iutegmtion. systeju, jmd locaJisyiUcm test- 
ing evolved over a jjeritMt of seven years. Fig. 1 illustrates 
I be testing pn»ec*ss iUiil tlu" use f>f tfie tools, 

Tlie lest process stiuls with the lest plan, a document de- 
scribing t]te scope, ajjjtroach, resources, aJifl scheihilc* of the 
intended test acrivities. The test |>lan states thc^ needs for 
test technology (i)atient simulators, signal generattjrs, test 
tools, etc.). This initiates subprocesses to develop or buy the 
necessaiy tools. 

Test design is the docuntentadfHi .specifying the details of 
the te.st apjiroacb atul ith^ntilyingtbe assoeiated tests. We 
follow three majfu' cati^gories of test design for the genera- 
titm of test eases (cjne can consider them ^ls tlie main direc- 
tU>ns of tilt ^ testini^ ap]>roaeh): white h(jx. t>hiek box, and 
risk and baj^;tnl analysis. 

The wbijt* bc^x U'sl design method is for design test, ujiit 
test,, mid iJitegration tests. This test design is tritally logic- 
flriven and aims mainly at patli and dc*cision eoverage. Input 
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Fig. 1. 'V]]v soJUvjirc It^sliiig pro- 
cess toy liV Onuiif 'jjfo piJiJeni 
niuiiitors. 



for tlie test eases comefi fioiii external mid intemjil specifi- 
ralioiLs ((leHign tlocuments}. The test design for jdgorithm 
validation fproor of physio] ogical iiieasureiiieiil algorilhnis) 
follows tlie wlutt' box method, although soiuelimes this is 
veiy difficulty especially for purchased algorillinis. 

The black box test design method is for functionaJ aiid 
system lesL Tliis tt\st design is dala-^hivrn iuid aims at the 
discovtny of functionality Haws by using exhaustive in j Hit 
tesliuf^. Input tor the test cases comes IVoni the cxtciiial 
specific at ions (as perceived by the ciistonier) and the in- 
tended use in a medical eii\dronmeut. 

Risk mid hazard analysis Is actually a gray box methtxl thai 
tries to identify the possible liazarfls horn the intended and 
nniiiTended use of the product thai may be ijotrntiai sources 
of luinri Ibr the patient or the user, and to suggest safeguaixls 
to avoifi sach liaziuxis. Consider, for iiistimce, a noninvasive 
blood pressure jueasurenient device that may oveq^mnp. 
Haz;:iicl analysis is applied to both harrlware (electronic mid 
mechanical) and sotlware. which hit erojje rate and influence 
each other The analysis of e\ ents and conthtions leacUiig to 
a potential h^izard (the mc^thod used is tlie fault tree, a cause- 
and-effect graph) goes through all possible states of the 
tiardware and software. Tlie risk le\el is estinuUed (the lisk 
spectnun goes from (catastrophic to negiigit>le) i)y combining 
the probabihty of occurrence and the impart on health. For 
all states with a risk level higher than negligible, appropriate 
sal^'giiarils are designed. The sateguarcis ciin be haitl in soli 
(or in most crises, a combination of l)0!li). The test cases 
derived IVoiu a hazard analysis aim to test the effectiveness 
of the safeguards^ or to pro^e that a hazardous event cannot 
occur 

Test cases consist of descriptions of actions mid situations, 
input data, and the exjiected output from the object under 
test according to its spec^ifi cat ions. 



Test procefhires are I he detailed instmctions ffU' the setup, 
execulioiL and evaluation of results for ojie or more test 
cases, Inptits for their development are the test <;ases 
(whicli are always environment independent) and the test 
environmem as defined and tiesigned hi the previous phases. 
One can compare the generation and testing of (he tes! pro- 
cedures to the iiT^plementation jihasc of code development. 

Testing or test execution consists of operating a system or 
component under specified conditions mid recorchng the 
1 est I Its. The notion of testing is actually nmeh broader mid 
cmi start very early in the product development life cycle 
%\itli si^ecifK at i<.Jii inspections, design reviews, and so on. 
For this paper we 11 itut Ihe notion of testmg tp the testing 
of code. 

Test evaluation is the report ing of the contents and results 
of testing and incidents that occurred diiring testing that 
need further investigation mid debugging (deled tracidng). 

While test design and the derivation of test procedures are 
done only once (with some feedback tmd rework from tlie 
testing in emiy phases, which is also a test of the rest), test- 
ing and lest evaluation me repeal able steps usually cmiied 
out several times imtil the software reaches its release 
c'riteria. 

Various steps of the testing process also produce test docu- 
mentation, which describes all the plmis for and results of 
testing. Test or validation results are veiy important for doc- 
umentuig the qualit>' of medical i>roducts mid aie reiiuued liy 
tegtilatoiy authorities all over Ihe world before the produt t 
can l)e marketed. 

The regression tesi package is a collection of test procedures 
that can l>e used for seledive retesting of a system or cr>nipo- 
!ient to veiify that nit jdiOcat ions have not caused unintended 
effects mid that the system or component still complies with 
its specified i3roduct mid legal reqthrements. 
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From the Test Plan to the Testware 

Ten or fifieeii yt^an* ago ii was perhm^* f notigh to ^-e a 
medu^al instninieiit to some experr.s for t'lii\ic;il 1 rials, 
aigoriUuii vaiidatioR, aiul it^. The msiruinent had a simple 
display, mfoniiation placemenl was not roiiHgurable, and 
the human interface was restneted to a few^ buttons and 
knobs. All tlie attention was on the tet^Jmit^al realization of 
tiie [utnlical solution (sudi as ECG monitoring), imd softwiire, 
mtiinly written by the ek^rrrit al engineers who designed the 
hardware, was limited to a few hundred statements of low- 
lex'f 1 languages. 

Today the medical instruments are higlUy soptiistif^ated open- 
arcliitecture sjsieiris, with many hundreds of thousands lines 
of code. They aie equipped with complex interfaces to other 
instnmicnis and the world t imagine monitoiing a |iatieni over 
the Internet — a nightmare and a challenge at tlie same time). 
Tliey are networked and can he remotely ot)eralcd. This 
roniiilexily and cijmiectivity requires totally new testing 
approacties, which in many ciises, me n<jt fe^isihle without 
the appropriate tooling, that is, the trsiwarr, 

DisciLssion of the test plan stalls relatively early in the prod- 
uct life cycle and is an exit criterion for the spec ificat ions 
jibLLse. One of the uiajor tasks of the tt*sting approach is the 
assessment of the testing technology' needed. Tlie term lech- 
nolog>' is used here in its narrow memiing of pnjcess plus 
hardware and software tools. 

Tlie lesling tecltntjlogy is rellned in the next phases (design 
aii<l initJlejuejilation) jmd grows ajui matures as the product 
undei develi>pment rakes shajje. On tJie other hiurtJ, die test- 
ing tools Jiiusl he in fikice before the ]7roduct meets its im- 
piementaiion criteria. This means that I bey should be imple- 
ment{*d and validated t>efore the product (or subproducl) is 
submitted IVjr validation. Iliis rcqtiirtnnent illustrates why 
the test techiuilog.y ( iiscus.sk »i should start vei^^ early hi I he 
prochtct lift^ cy<le, attd why I lie t(^stw;ire has a "phase shift 
to tile left" with rt\specl t(j Iht^ pro<luct valitlation phase (set» 
Fig. 2j. 

Test Tool (Testware) Development 

T(?stware fieveio]iinent follows the same product life cycle 
as the produf^t luider develoijaient. The pbasi^s are: 
fieqtiirejnenlsand Define ion Phast*. The test needs arc* 
explained at cording U> tlie test plan and I he high-level test 



de^igiL Alteniaiives are discussed and one is selected by the 

sot\w;uTe qiiaJity ieam. 

• Specifications Phase. The tool is described in as much de- 

t«iil as possil>le to enable the testers to start work with their 
test cases and test procedures as if the tool were already 
availalile. These spedfieations are re%=iewed (or fonnally 

inspected) by tlie product devekjpment ajtd test teams, 
approved, and put under revision control. 

• EJesign and Implementation Phase. EnTi>hasLs is an tiie rapid 
development of engineering prototypes of the tools, w^hicli 
again are the object of review. These prototypes are used by 
the test team f<jr low level lest design and firsi lest trials, 

• Validation Phase. Tlte test tool is valitlated against its speci- 
fications. Tlie most up-bMlate revision of the jjalient moni- 
tor imder development Ls used as a test bed for the tools 
validation. Notice the invei^^ion of roles: ttie product is used 
to lest the test tool! Our ex|ierience show^s that tliis is the 
most fttiitful period for fmduig bugs m botli the tool and 
tlie product. A regi^ssion package for future changes is 

CI eat eft Pirst iKU'dvvare cfmstnietion is started if hardware 
is hivohetl 

• Product if jn PIillsc. The tool is ofllcially re leasts I. hardwcye 
is produced [or bougiit), ^md the tool is nse<.i in the test envl- 
mnment. After some period of tinier when the tool's maturity 
for the test pun>oses has beet) assessed, the tool is made 
public for use by othei- test deiiarinieuis, liy nuirketrng for 
demos, by supf>ort, and so cm. 

Fig. 2 flemonstrates the main difficulty of testware develot)' 
meni: tlie test tool specifications can be createci after the 
product specificafions, but from Ibis point on, all of the test- 
ware developineut pbasi^s should be earlier than the product 
develojimenl i>hases if the prothict is to he validaled in a 
timely niannen 

Besides the shift of the develojimenl phases, thtTe isal.s<» 
the lest w me dilemma: us Uie [jrogress of the tJroduct's di^- 
sigii and tlie test design leads to new perc-eptions of how 
the product ran be ti*sted, new^ oj)i>oili initios or limitations 
appear that were previously unknown, and infliu^nce the 
scojje of the testwaie. The resulting changes in the testware 
innst then be made very quickly, more quickly than the 
clumges in the product. Only the at>j)licatiott of good hard- 
ware iuu] softwfire engiiUH'dng iM'oi t'sses (the tester is also 
a devekiper) can aviiiti having I he wrong lest totjl for the 
product. 
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Another Approach to 1 estiug: 
Inspections 

Inspections have proved to be 3 very powerful white box testing method. 
Inspections are performed in sEI phases of the product life cycle starting 
wjth the product specif icaimns. (Externai specifications inspectiDns are 
an exit criterion for the specifications phase). The inspection goals are 
100% for specifications (exit criterion), 50% for product design [far the 
most critical parts accordrng to the results of the risk and hazard analysts 
and the product architecture), and 20% for code (most critical parts) 

Although there is no defined goaf for test de^^ign inspections, in practice 
■about 75% of test design (high-level and test procedures just before 
automation) is inspected formally. Code inspections are performed by 
two or three engmeers. Specif rcations inspection teams are larger — the 
crossfunctional team as well as REtD exports are participants. The in- 
spection process has the following steps: 

•Kickoff Distribution of inspection material and logging meeting logistics. 

•Logging. All items are logged. Short explanatory discussions are allowed 
(less than three or lour minutes), A moderator and a scribe are always 
assigned to facilitate the lagging meeting, However, there is no chief 
moderator assigned in our laboratory, 

•Follow- up and Rework. This step ensures that all of the fixes and clarrfi- 
cations that were identified as necessary in the logging meeting are 
done and all items are addressed, In an informal message to all partici- 
pants of the logging meeting, all fixes are explained and the reasons for 
the unfixed issues are discussed. 

After extensive training and with massive management support, this 
ir)speciion process works very well and is a fixed part of the product life 
cycle. 



AutoTest 

Tlie l(^Nt teclmolo^v ttsscssmerit forthr patient monitors led 
us to Uie elf velopn lent, of a luiiiil^cr of tools that coiild not he 
found on the market. Tltis make instead of buy dGcision was 
based mainly upoT\ the nature oftlu^ imtienl monitors, which 
have m any CPUs. \ >n j j j ti et aiy t >] i e rat i n g ay st e m.s n 1 1 d n et - 
works, proprietaiy tinman interfaces, true real-time behav- 
ior, a tot of firmware, and a low-level, close-lo-llie-Enaciiine 
programming style. Testing sJiould not be allowed to influ- 
ence tile internal timing of the prodnctr and inv^Lsive testing 
(Imviiig the tests and the oljjects imder test on tlie same 
computer} had to be avoided. 

The first tool developed was AutoTest,^ whit !i atldressed 
the need for a tool able to (1) simulate the patient's situatirjn 
by driving a number of programmable i>arient siniulat oi's, 
(2 ) simiiiate user interactions by chiving a progranunable 
keyjiusher, and V3) log the reaction of the iiistniment under 
test (alainns, pliysir)logieal values, waves, recordings, etc.) 
by taking, on demand, snapshot.^ of tlie information to senci 
to the medical network in a structured manner 

AutoTesf: was fuiliier developed to accept more simulators of 
various parameters imd external non-IIP devices such as ven- 
tOatoi-s and special measurement devices attached to the HP 
patient moniton AutoTest now ceui access all infoimation 
ti-aveling in the internal t)us of the instrument f over a serial 
poll with \}^e medical computer interface) or adchtiona] uifor- 
mat! on sent tcj external apphcations (see article, page 103). 



AutoTest is now able to: 

• Read a test procedure^ aixd interpret the insf ructions \n 
special electronic devices or PCs simulating jjhysk.^logical 
signals 

• Allow user in|)iil for up hj 12 )>atient monitoi^i siinultaneoiksty 
ov£*r different keyi aishers (12 is the niaximimi number of 
RS-232 uiterfares in a PC) 

• Allow user injiiit with context-sensitive keypushing (first 
search for the existence and position of ;in item in a menu 
selecti<jn aiul then activate it) 

• Maintain defined delays and time dependencies between 
various actions and simulate power failure conditions 

• Read the leaction of tlie device undenest (alanns, [jtiysio- 
logical values and waves with all their at tribiiteSs vviiidow 
contents, data packages sent to tlie uetwork, overdi flatus 
of the device, etc.) 

• Drive from one PC slnmlfaneonsly the fe.sls of tijj to four 
palieiit nioMjtors (hat interac t witli each other and e^cchange 
measurement nuntules physically (over a switch box) 

• Kxeeutc t>atch files wilh any combination of test procedures 

• Write to protocol files ail actions (user), simulator connuands 
fit.ir j)hysiologi( at si:gnals (patient), and residts (device under 
test) with the a[jpropriat.e time stamps (wilh one-second 
re.solution). 

AutoCheck 

Tlie succ-ess of AutoTest anrl the huge amount of data [no- 
dut ed as a result of testing quickly led to tJie demand for an 
auLonialed evaluation tool Tlie first thougtits and desires 
were for an expert system that (1) would represent ex7:ilicitly 
(he .specifications of the instnnuent under test and the Riles 
of (ttc I est evaluation, anci (2) would haveaji atiaptive knowl- 
edge base, i'his saltation was abandoned early for a more 
vei^at lie procedural solution named AuliiCheck (see anicle, 
page 103). By using existing eompller-buHdtng knowledge 
we built a toot that: 

• Enables tlie definition of the expectefl results of a test case 
in a formal manner using a high-level language. These for- 
malized exijected results are part of the test jirocedure and 
do(*ument a I the same time the pass-fail criteria. 

• fieads the output of AutoTest containhig the expected and 
actual results of a test 

• t:ompaies dte ext>ecte<i with the actual results. 

• tTMssifies aiid repods die dilTerences according to given 
criteria n\u\ condirions in i^nor llles similar to compiler 
error files, 

AutoCheck hiis created totally new mid remmkabte |>ossibi- 
lities for the ev-aluation of tjests. Huge amounts of test data 
in protocol files (as nmch as 100 niegab>les per day) can be 
evaluated in uuruites wtiere jireviously many engineering 
hours were spent. Tlie danger of overlooking something in 
lengtliy protocols full of nunibei^ ajid juessages Ls elimmated. 
AutoCheck proiides a much more prrjductive approacti I'or 
regression imd local language tests. For Itjcal language tests, 
it even enables automatic translation of the h.im tali /eft t>ass- 
fail c!iteria tluiing n.m time before com}jarison with the 
localized test results (see article, page 109). 
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ATP 

Tlie next step was ttie cievelopmejit of a son of test generator 
tiuH would: 

• Be able to wriie complex test procedures by keeping the 
test desi]^ at the higjiest jKissible level of at>strartioii 

• Eimiji*^ greaUT lef>t co\erage b> being able lo alter the entr>^ 
condiiiuns for each test case 

• EJinihiale the debugging effort for new test procedures by 
using a library' of \^idated test primitives and functions 

• Take art^ount of (he particiiiarities and con Figurations of 
the moiMtors under test by autoiualic seleclion of the test 
primitives for each conriguration 

• Produce (at the same tiaie as the test setii|> and the entiy 
condirions) the necessar>' instructions for automated 
evaluation by AutoCheck. 

The resulting tool is callefl ATP (Automated Test Processor, 
see article, page 95). Like Autf»ChiH*k. ATP was developed 
by using compiler-building technoiogy. 

Results 

Good test design can produce good and reliable manual 
tests. The indusTT>' has had ver\' good experience with sound 
nianiiai tests in the hands of experienced testers. However, 
there is no chmice for manual testing in ceitain areas of 
functionality such as intequ'ocess conmmnication. network 
ccmummicatjon, CPl' loatl, system load, iuid so on, which 
can only be tested wi\l^ the help of tools, (Jur pioct*ss now 
leaves tlie most tedious, repetitive, and resource-intensive 
parts of the testhig prot^ess for the automated test ware: 

• ATP for the generation of test procedures! in a variety of 
ronllgunitions and sellings biused on a high-level test design 

• AiitoTesi for test t^xeciKion, 24 hours a ctay, 7 days a week 
m\b imlinutee! coinbjnHtions of tests and repetitions 

• An I o( heck for th(* automated evaluation of huge amounts 
nric^si proiocol data. 

( inc of the most inten»sting facets is tlie ability of thesi* tools 
toself-docnment their output will; comments, time stamps, 
and so on. Their outjiin can be used without aj\v filtering to 
doeimient the test generation willi pass-fail criteria, lest 
execution whh all exec ution cletails (tesl log), and test eval- 
uation with a c lass ifica lion of the tliscrepancies (warnings, 
cnToi':^, range violations, validity errors, etc. ). 



Automated test ware provides us with reliable, efficient, and 
econoniica! testing c^f difTereni (^imfigtirations or different 
IcK^alizeil versions of a product using the wimc tesi environ- 
ment and the name test prtK'eciures. By RiUowing the two 
directions of f I) automated test ware for fmirtional, sj^tem, 
and regre^ion tests (for better test coverage i and (2) inspec- 
tion of all design, test design, and critical code (as idenrified 
by the hazani atialj'sis), we hav f* achievefl some remarkal>le 
results, as shown in Fig. S. 

Through the years the patient m*>nitor software lias Ix^rome 
more and inon' complex as new nieasuremenis and mter- 
faces were added to meet int reased custouier needs for 
better iUKl more efficient heahhcare. Ah hough the jioftware 
size has grown by a factor of iluee in six years (and with it 
the testing needs), the testing effort, exprt^ssed as the num- 
ber of test cycles times the lesj cycle duration, lias dn*pi>ed 
dnimatically. The number of test cycles has liropped or re- 
mained stable from release to release. 

The predictability of the release flate, or the length of the 
v^ilidalion phase, has improved significanUy. Tliere has been 
no slippage of the shipment release dab* with tlte last four 
releases. 

The raticj of automated tr> mmuial tests is c^onstajitly imji rov- 
ing. A single exc(*ptif m con linns the nile: for one revision, 
la* k of autrmiated ti'.srvv^ut* for the ik*w functionality — a 
motiulo to transfer a patient database from one monitor lo 
another— forced us to do all tests for this function manually, 

Tlie test coverage and tlie coverage of the regression testing 
has improved over the years even though tlie percentage of 
rcgrt\ssion testing in the total testhig ctToi! has constantly 
increased. 

Conclusion 

Sot I wan* qiiality does not star1 Hnd sunAy dtjcs not end wiih 
testini^. Bet ausc tesling. as iht* term is used in fhi.s nilirle. is 
Hpi>lied to the final products cjf a de\eki|>ment ijhase, defect 
ihstovery ihrtnigh testing always Jiappens t(*o tale in each 
phase cjf product development. All the experience gained 
shows tluit tleleci jjrevt niion aciiviiies (Ity af)j)lying die ap- 
propriare const ructisp softw<ut* engineering methods during 
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product, developnienl in aU pha*ies) is more pmdiKiive than 
any analytic c|uality as^iiraiice at the enci of Uie cievelo[jnic*iit 
process. 

N**verthpte!^s, lesting Is the ultiniate JseTUiiie] of a qualify 
assuraiiee sysleni l>efor*^ u produc-r reaches the next ph;us<- 
in its hie cycle. Nolhing can replace good, effective testing 
in (he validation phase before the (jrotlnct leaves RM) \u ga 
tomanufaclunng (ajid la *nir ciistnmcr-8). Even if (his is Iju- 
only and unique lest rycjc^ in this pluLsc (if 1 lie delt>ct preven- 
tion activitiejj produced an eiTorfree prod net, which is still 
a vision], it has to he prepared very carehilly tuid he well dor- 
tiinented. This is especially tnte lor safety-crilieal software, 
for vvhic li, \\\ addition t*i ftnitE if vitality, the erfecliveiiess of 



all safeguards under all possihle failure conditions has to be 
tesie<f 

In this effort, antoinated test ware is cniejal to eti.surin^ reli- 
ability (the I est Wine is (orreei, validated, and does not pro- 
duce false negative results), efficiency (no test renins be- 
cause of test ware prohlerns), and economy foptinnzatj<jn of 
ri'scjurt t^s, especially hnniiui resources J. 

Reference 

l. IX Gimng, ■'Ati Automated te'st Enviroimient for a Medical Patient 
Monitoriii|i System." IlewirU-Pftckat'd JmnmtL Vnf 12, rm. 4, 
flrrot)er l!»91.(}p. 1Em:j2. 
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A High-Level Programming Language 
for Testing Complex Safety-Critical 
Systems 

Dealing with an enornious amount of data is characteristic of validating 
complex and safety-critical software systems. ATR a high-level 
programming language, supports the validation process. In a patient 
monitor test environment it has shown its usefulness and power by 
enabling a dramatic increase in productivity. Its universal character allows 
it to migrate validation scenarios to different products based on other 
architectural paradigms. 

!n Andreas Pi r rung 



This article concentrates on tlie ^specific problem of traiis- 
forniing a test design into concrete aiiton>atic test proce- 
dures. For a systetnatit- oveniew and c^ontext the n^ader 
is refcned to the Liilicle on jjage 89. As descrit>ed in that 
aiticle, r]ie test design idenrifies and ducnnicnls the test set 
lor a gi^'cn ]jroduct. It is derived from external antl internal 
si>eciflcations, software qtiality engineer expertise, and risk 
and hazard mmlyHis results. A lest desit^n Is normally infor- 
nuil and descrilies te8l cases jmd test data on a high, al)s(ract 
level, nuleijendenl of the test environrnenh On the other 
haiui, aji antomalk: test jH'ocefluce has to (h^al with all I he 
detmls of die test enviromneni <ind reflect.s the ahsiraciioti 
cai^ahilities of the existing tools. 



In oiu: softwaii^ tjuaiit>' engineeiing department die automatic 
test enwonmenl is based upon two m^or tools: AutoTest^ 
antl AutoCheck. Tlie fast is a te.st execution t(H>l and the 
latter is iTSiJonsible for wsi e\ id nation (si*i* artit:Ie. t^age KKi). 
AutoTest is \'er^' close to the devices it controls and requires 
detailed comnuuids on a low abstraction leveL AutoCheck 
has to coiM* with the delailetl iow-level utfoimation produced 
by AuKjTest and therefore also requires inimi on a tletailecl 
low abstract ion level (.see Rg. 1). The strengths of the h>w- 
level tuierfares atv their (lexibUity and a(la|>tal>ihiy lo \atioiLs 
different test stt nations. 
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Tlier«^ are soitiediffK-u Hies with the pnyi^e^ showij in Fig. L 
Tlii^ test engineer speiuls <i lot of time tiaiisforjuing test de- 
signs intfj aulomaLit' test iirocediu't^s. Tlierp is a large gap in 
abstraction level betirveen tiie test design and tlie test proce- 
dure. The detail level is low in the test design, but very high 
in the test procedure. It is an enor-prone, time-consuming 
task to bridge this gap maim ally. Because resources are 
always restrictecl the software quality engineer has le.'^s 
time for a more mtensive test design. 

Because the test procedures have a tiigh expiJt it r^'ftunriaiicy, 
it is difficult to maintain and evolve test procedures. Tlie 
ext>licii refhindancy is higli herause AuloTest and Auto- 
C'heck do not suijport dataanrl hmctional abstraction, nor 
do they offer conTi'ol flow elements. A piece of code may 
exist in many copies scattereri over thf test procedures. If 
the test requires a change in die code i)atteiTi, for instance 
becatise of changes in I he timing behavior of the system 
under test (in our case a patient jiionitorj, the test engineer 
Ims to update rmnterous copies of this code pattern. The risk 
of forgetting one |>attern or introducing an err<n^ in a test 
procedure increases with die numlier ot ii[idate steps. It is 
ver>' reason rce-curtsuniing to adapt test procedure.s lo a 
chai^ge in system tjehavion 

The test procedure describes a static test sceniiiio. Tliere- 
fore, the test engineer has to docmiient the test setup conv 
pleU^Iy. Every parameter that inlliiencesthe test enxiron- 
men I and < onseciuently the tc^st cxc^'ution miis! be cai'efuUy 
control ted before starting 1 he automatic test. Our test envi- 
ronment cfjasisis of so many sinmlators, torcing devices, aiul 
sensing devices that sometimes tests need to be repeated 
because the iiuiial ( ondifions are wrong. The ynoblem is tiiat 
tlie automatic te.si procedure descril>es only one specific 
test situation. It ls not ixjssilile to use ]>arameters for the 
test procedure and to feed In the actual stall pai^ameters at 
the l)egitming of the test execution to get more gencial tmd 
roliust test procediu'es. Even asli^^i^t change in (he stai1 con- 
dition may require an adapted or neai ty new test proredure. 

The test coverage is limited because the test data is c^xled 
within die test proct^diu'c. Tiie re]jctition of a test case with 
other test data requires a moditiett duplicate of the test pro- 
cediu'e* Again, it w(jul<.t help if a test case were able to pirjtit 
from data abstraction and parameters, enabling the test 
engineer to fornmlate nuxre general test pjocednres. 

AutoTest and AutoCheek do not support the statistical slmc- 
tiu'al testing approach. It is therefore not jjossible to select 
test data randomly (see "StiTictural Testing, R^uidom Tesihig. 
and Statistical Stnictural Testing" on page 97). 

The tbllowiiig sedion illustrates tJte alu>\e problems by pre- 
senting a practical example to demonstrate tlie transforma- 
tion of a higli-le\T^l test design to an automatic test procedure. 

A Practical Example 

Patient monitors are electronic medical devices used to 
monitor i3h> sioIogicLil paimnetei's of critieall>^ ill patients in 
hitensive cai'e miits or operating rooms. Tliey tdert tiie medi- 
cal staff when physiological parameters exceed preconfig- 
imed limits, hi tliis example, we will concentrate on a well- 
known physiological paianietcr, the lieail rate. The nurses 
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Fig. 2. Hf^arl ratp (HR) akmi ipst jjiIik iple, 

and doctoi"s want to get an immediate alaini when the heait 
rale falls below a given lower limit or exceeds a given higher 
limit, A malfunction oT the monitor may result in the death 
of a patient, so this rurKlionality is safety-criiical and nuLst 
be vahdated wry carefully by the venitor of tlie patient mon- 
itor. The extunple illustrates a test design for heart rate 
altirm testing anti the transrorniati<in proc ess to tlie appro- 
priate automat ic test luocednte. 

Fig, 2 shows the upper and lower alarm hmits for the he^ut 
rate j>ai'amelcr. Tlu' data siuice can be divitied into tliree 
subdomiuns ( t^^uivalence classes): 

• Tlie noniiai heart rate domain — the inter\^al between Lite 
alarm limits. The monitor should not alarm for data points 
taken from this area. 

• Ttie lower alann tlomaiir All data here prrxiuces a low liniit 
alanir 

• The u|)|>er alann doiuain. All (iata here |>roduces a liigli limit 
alarm. 

A ckissic method of testing tlie alarm behavior is to seleet 
representatives hom each of the three areas and check that 
the uHJuitor reacts as expected. Fig. 2 shows tlte selected 
data points aiul tJu^ir aidei in time. 

This graphical representation of the heart rate (IIK) alann 
test leads to the following test design: 

• Test Case 1: 

Action(s): Confignre HR alarm limits to -lO/SO. Apj)ly 
signal HE 45. 
o Expected: Low limit ataiin witi\ text " ■' ■ IlE 45 < W\, 

• Test Case 2: 

Action(s): Apply UK signal 49. 

Expected: Low limit alaiiii remains with text ' ■"'IIR 49 

< 50". 

• Test Case ;J: 

Action (s): Ajjply Hfi signtil 50. 
■ Expected: Low limit alann disappears- 

• Test Case 4: 

o Actio n(s): Select 5 dilferent HU \aiues between 50 and BO 

and apply thenr 
- Expert eil: No HR alarm for each of the ^selected \Tilues. 

These few test cases are enough to demonstrate the prin- 
ciples of test design. Tlie appropriate automatic test proce- 
diu-e description for test case 1 on the AutoTest and Auto- 
Check le\el then looks like: 



96 



J uiK' 1 ! +[iT H I'w iv a- Pat!kiu d Joii n ud 



)Copr. 1949-1998 Hewlett-Packard Co. 



ii ^^^^_,^- . Test Case 1; 

// Adjust alarm limits to 50-SO- 

// 

merlin p^xaiit 

merlin *fiR* 

merlin £2 

raerlin f7 

merlin f3 'a48 

merlin f 6 -iiI2 

merlin f4 -nl4 

merlin f5 -nS 

// 

// Apply EIK signal 4S (normal sinus beat) . 

// 

siml HSB4 5 

// 

// Delay IQ b : after that time the alarm must 

// be announced. 

wait 10 

// 

// Check if low limit alarm "•* HR 45 < 50" is 

// present, 

'*' Verify begin 

*^ Alarm "HR" < al_min; 

// Low alarm active. 

* sound is c__yellow; 

// Limit alarm sound audible, 

^ Verify end 

mecif tune HR 10 

// Tune 10 the HR numeric of ttie patient 

// monitor. 



This exanipic demonslratew ihe (Uffereore in ab??ti-ariirni 
between a test de.sign (higfi abslradion ) mul an automatir 
test |>rf>fe(lure (low abstraction} and gives aii ini])ression of 
ihe tlinifullieii m>led above. Aii autoniatir test dcscriptirm 
language designed to alleviate tliese (Jiffuulties should nffer 
aJ^)Si rati ion capabiiilies lo hide dcMiiilsaiid t<> eonipoj^i'^ rone 
plex rami ions irraii simpler lunrtions. Like evcuy hi|;li-li'vel 
prngraniniing language, it should briclge the abstraction gap 
atitomaHeaUy. In Uie folkmdiij^ scnlion a solution is presented 
tluil iiieels liiese ntH'ds. 

The ATP Program u ling Language 

Ofleit speeifK* [irobU-ius need Imslv piveessors like AutoTest 
or Autoflieek lo piTloJin souu* tjperatior\ stub <ls pushing 
keys, siuuikuing patient signals, simulating powerlail condi- 
tioris, and so on* A stniightforwaifl solution might extent! 
the eominant! Interf^ees of AutoTest aiid Auto (* heck to sup- 
port data and hauiitmal abstraction, provkle c^rmtrol flow 
elenitiits like eojiditions inid lrK)jjs, mid allow hirther ])roha- 
liilistic tlatii general ioiL Tliis would jirobaldy eliminate the 
<)irfi cutties mentioned above. However, reduntlant effort 
would have to bv spvui implementing an abstract command 
iiiterl^ice again and again. 

Our scjlitiion is ATP (Automated Te^t Pmet^lnre), a high-level 
tirogranvming latignage !ba( otfiTs die al^slradion rarilides 
atul makes it (joss i hie to integiate \nis]v [jroeessors smoothly 
ATP allows the integration of ntaity di^erent basic^ proces- 
sors. Si* dic^ ci>orflination of the bcisic processor is much 
easier llian with sejiiimle rontrol. 

The following is a tyjjit al ATP routine rejneseuling the auto- 
rn;ilii Tesj piru edun* tor tlu^ li*';nl rate alarm lest: 



Structural Testing, Random Testing, 
and Statistical Structural Testing 

Random testing ^s one of the more common test sirategtes It does rat 
assume any Itfiowledge of Uie system under test, us specifications, or 
its rntemal design This techniqye is insufficient for validating comptex. 
safBt¥H:ntic3l or missitsn-aitica! software 

The structural testing approach svstemaiically derives the test proce- 
dures from the external and internal specif icsnons Therefore, the term 
test design best describes the mental active ly behind thfs method The 
struciyral testing approach divides the input dam space into subdomains. 
The cnteria for this partitioning dm given by the external specification of 
the system Bach subdomain is an equivalence cfass which is tested by 
choosing some repfesentatives But what if the subdomain is heteroge- 
neous, has unknown Sfde effects, or includes errors if executed in a par- 
ticular order"^ (A heterogeneQus subdomain includes both good and bad 
data points Good means that the system works as specified, whereas 
bad data leads to system failure. For exampfe. m the heart rate alarm 
test described in the accompanying article, the high alarm limit domain 
may contain data points that, when applied to the patient monitor, pro- 
duce no high limit alarm Other data points may behave as expectadl 
Unfortunately, the subdomains are seldom homngenous or dtsjornt. 

Waeselynck 8i Thevenoi-Tosse^ showed that a statistical component has 

to be included to provide a sufficient test data set for a subdomain ^ This 
approach is known as statistical siruciural testing, Our experience has 
shown that this strategy leads to the best results. 

Reference 

1. H. Waesefynck and P Thevenot-FDSse, ''An EKpef<mentation witti Stat^ical 
Testing," Pfac^Bdings of the 2nd European fntsmatmat Conf&mnce on StJftWt^m 
Testing Anafysfs ^ Beview, 1 994 



DEFINE AlarmTest ( IN Patient Size CHECK IN f 
"ADULT", "PEDIATRIC'% ''NEONATE" 

IN Category CHECK IN { "0R'\ "ICO"} 
) 

DESCRlP*riON 
PtmPOSE : 

This routine dejuonst rates spme of the 
ATP features. It is an automatic test 
procedure testing the KR alanri 
capabilities . 

SlGNATtJRE : 

AlarmTest ( <PatientSize> , <CategorY> ) 
EJTD DESCRIPTION 



LOCAL HBValue,/* selected HR Value 
AIj, /* HH low alarm limit 
AH, /* HR high alarm limit 
walk /* repetition counter 



initialize the Repository 



*/ 
*/ 



LINK Repository <- "$PatientMonitorRepository" 
Repository; In it (PatientSize^ Category) 

/* Declare the use of a function 
repository and initialize the 
repository link* This gives 
context "Specific access to all 
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available functions for the 
given Patients! ze and Category, 



SIGNATUKE ; 

RajidomA 1 annTe s t 
END DESCRIPTION 



( <repetitioiis> ) 



* .-^^ Teet Case 1 -' 

Act ion (B): Configure HR Alarm Limit a to 50/50. 

Apply Signal HR 45. 
Expected; Low limit alarm with text 

^'**HR 45<50^'. 



HRrSetAlarmLitnitB (50, 90) 
HRiSimulateValue (45] 
HRiCheckAlann (10, "** HR 4 5 < 50") 

/* Check for limit alarm after 

delay of 10 s. */ 



LOCAL 



* , __„ Test Case 2 

ActiorL(fl) t Apply Signal HR 4 9 

Expected: Low limit alarm remains with text 
"**im 49<50" {alarm string is 
updated without delay} - 



HR; Simulate Value (49) 
HRiCheckAlarm (0, ''** HR 4 9 



AL, 


/* 


low alarm 


limit 


AH, 


/* 


high 


alarm limit 


walk, 










HRValue 














/* Initialize the Repository */ 

LINK Repository <^ "$ Pat ientKoai tor Repository" 
Repository: Init (CHOOSE ( {^' ADULT" , "PEDIATRIC" , 

"NEONATE" } J , 
CHOOSE ( { "OR" , " ICU" } ) 
) 
/* Declare the use of a function 
repository and initialize the 
repository link. Choose 
patient size and category 
randomly. This gives context - 
specific access to all avail- 
able functions for the given 
PatientSize and Category* */ 



50") 



/*_ _ Test Case 3 — — 

Action(s) : Apply Signal HR 50, 
Expected; Low alarm limit disappea.rs, 

^„___ */ 

HRiSimulateValue (50) 
HRiCheckNoAlarm (S) 

/* No HR alarm present after 5s* •/ 



/*- — 

Action(s) ; 

Expected: 



Test Case 4 — 

Select 5 different HR values 
between 50 and 80 and apply them. 
No HR Alarm for each of the 
selected values* 



walk <- 1 

/* Randomly choose some HR values 
in the range 50/90, i-e,, no 
alarm condition exists and 
ttteref ore no HR alarm, must 
be visible and audible. */ 
WHILE walk <= S DO 

HRValue <- RANDOM t50, 80, 1) 
HR:SimulateValue (HRValue) 
HR i CheckKoAl arm ( ) 
walk <- walk + 1 
ENDWHILE 



/. __-__^ -. ... */ 

/* Randomly select valid HR 
alarm limits, then randonily 
select an HH value and check 
if the monitor reacts as 
expected* */ 

walk <- 1 
WHILE walk <= repetitions DO 

HR; Randoms electAlarmLimits (AL^ AH) 

/* randomly select valid alarm 
limits */ 

HK: SetAlarmLimits (AL, AH) 
HRValue c- RANDOM (20, 180, 1) 

/* select HR values between 20 
and lao with step width 1* */ 
HRrSimulateValue (HRValue J 
IF HRValue < AL THEN 

HRiCheckAlarm (10, "•* HR " + HRValue + "<" 
+ AL) 
ELSIF HRValue > AH THEN 

HH:CheckAlann (10, "** HR " + HRValue + ">" 
+ AH) 
ELSE 

HRiCheckNoAlarm (5) 
END IF 

walk <- walk + 1 
ENDWHILE 



END RandomAlarmTest 



END AlannTest 



An autoniaTif !t^s1 ] procedure loi a laiuioni heart rale alaiTii 
test in ATI* nughi lot>k like \bv tuilrming: 

DEFINE RandomAlarmTest { IN repetitions ) 

DESCRIPTIOK 
PURPOSE : 

Random HR Alarm Test 



Even wiiiujut faiuilimiiy wiih thesyiit^ix aiitl semaulies of 
1 lie ATP lajigiiaget it CcUi be recognized thai I he absiraeticm 
level Ls higher than vviih phuu nxle lor the hasic priK^essoi^s 
( hi our case, AiikiTesl and AiitoCheck). It is also worth 
noting thai the automatic test procedures aie nol restricted 
to a specific jjatient monitor. They describe in a ijeneml and 
abstract way a heart tixXq tilann test for any patient monitor 
with a limit alann concepL TIte differences teiween specific 
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patient monitors are in the basic processor interfaces and in 

ihe primitive functions. 

The ATP Concept 

ATP loiLsists of two ni^jor ftinctional elements (see Rg, 3): 
a set of toob to maintain a funciioii repositoi>' and an ATP 
language interpreter, 

Tf^e tools give the user adequate actr» tt. Uit- tunctioii re- 
(icjsitor^'. whicli contains weil~c!ocunientefi, well-tested ATP 
l\mrtions that can he reusccl. This cfTet^rively reduces rctlun- 
danc>^ and increases pr<xlticti\ity (see the next section, 
"Working with the Function Repositon^"). The tools can be 
grouped into: 

• File-oriented tools- Check funetioiis into and out of the 
function repository, compare two versions of a ftineiion, 
etc. 

• Repository qtiery functions- Obtain infonnation about avail- 
able funtiions. abour an interface of a function, eit/. 

• Repositorj^adntinistraTion functions. Administer anfl niain- 
taiu rlie Jitnicture of the repository. ^Ulow archival and re- 
in e\al of the repository. These functions are only aciessible 
l)y the re posit (J r>" adriiinistjator. 

\^lth tjiese tools and a text editor, a programmer writes ATP 
functions by reusing existing funcrions from the reposttoiy. 
Wlien the fimction repository is well-strucrured and tjffers 
rehabk* functions on im adetiuate abstraction level, it is easy 
even for an inexperiencecl t>rcjgranimer to write fnnclions, 
as shown in the example above. 

Interpreter 

The core of the system is the AJP language interpreter. The 

interin eter requires an input file and an output file. The 




input file is an ATP function, hi contrast to other common 
high-level languages, ATP requires one file per runction. One 
ad% aniage of this approach is that each ATP function is exe* 
cntMile. Tltere is no explicit syTilactical distinction between 
a main murine and a subroutine. Any fimction ran l>e tlie exe- 
cution starting point and cim call any other fimction. There 
is no explicit function Merarchy. The liierarchical stnitiure 
is pro\ided independently by the rejiosiiorT stiircture. 

Another ativantage is dial, because each fde contains only 
one fimction. it is nmch easier to administer the fiuictlcins hi 
the function reiM)sitor>. To fiitfrli stmciuial requirements the 
functions can be grouped by any criteria. Tlie heart rate 
example above uses functions tiiat dl belong to the logical 
funttlonai gmup HR. 

The iiitetpreter outpni data c^an be written into an output 
file. A powerful feature of the language Is its a!>ility to inie- 
grBtefhrmai pmressof^^ A lonnat processor is a problern- 
doinain -special fie process (a basic piocessor) that cim be 
integrated iiUo tlie intei^reTation process. In the patient 
monitor lesiing ex ample, AutoTest and AutoC'heck are for- 
mat pn.icess<jrs. The ATP language offers syntactical ele- 
mc^nts to establish a conmiunicaticms channel to a format 
processor so that withui the ATP code the user can send 
any infomiation to tlie processor The format processor can 
send back intViniTation to the ATP inii^ipreter, which then 
fan be sent to another format pnn*ess(jr or logged to the 
ourpiit file. The creation of this AI'P adapter interface is mi 
easy task, thanks to an API that enables a programmer to 
implement this communication interface with ATH If an 
integrated fcmnal iirtKesstjr is general-punjose, it can l)e 
offered to all ATP programmers. A good example is KSH. a 
foiTirat processor that enables ATP prdgrairuners \o inte- 
grate^ Koni shell commands within ATP code. The formal 
processor ('oricej)t ami thealxstradii^n taeiliiles of the ATT* 
langitage iiffer the progranrnrer the means to rmjdel the 
problem domain in an adequate and flexible way. 

Working with the Function Repositorj^ 

The following shorl tnur illnsirah's s(tnie of the tools Miai 
are available to hmitlle tunetioa repfjsiiorie.s. Snpp<)se lliai a 
pr(jgr*immer wants to know which functions in the repository 
are available for dealing with heait rate operatifins. T^iving: 

Liblodex 
Group I "HR" 

will, for example, produce the oiitput: 

CheckAlarm 
Ch ec Jc No A 1 a rm 
SimulateValue 
SetAlarroLimits 
RandomSelectAla rmL imi t b 



To get infomiation about tlie interface of the SetAlarmLimits 
function, the tirogmnrmer ran ty^je: 



FarniBt 

Processor 

Pool 



Tell Me 
Group: "rm" 
Functions: 

and will get: 



'SetAlarmLimits" 



Fig- B, ATP inn€0|>! ovfi-vifivv. 
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**** + + ******** HH;SetAlarTiiLimita ***** + *#***#** 

* purpose: configuire the Heart Rate lower and 

* upper alarm limits. 

# 

* signature; SetAlarmLimits ( <lower alarm ltniit>, 

* <higher alarm liiiLit:>) 



The hpai1 rale fesf exaiiijjie alxjve uses this aiul other furu - 
tions. At the begmning of the l\ni<:Uon a LINK statenieiil 
declaieH mi access jjath to a given function ret)t:jsitcny. Then 
tlie funriioiT reji^ositiny Ls initialized. This initialization fimt - 
tion introdnct^s all available functional groups given a spe- 
cific system context. At thai i;oinl the progianniier is aiile 
to caO the finictioiis, for example HRiSetAJarmLimits, without 
knowing implementation details or physical locations. It is 
possil>le to check out a function for enhancement or mainte- 
nance purposes. It is also possible to check a fund ion into 
the repository so that all test engiiieeis can use the new 
function. 

Format Processors 

Fig. 4 p resell lis the possibilities imd the flexibility^ (jf fonnal 
processotBn Eacli format processor consists of two parts: a 
basic processor and an ATP atkipter. Tht^ basic processoj* is 
a propiielaiy |)ai1, thai is, any ex<:i ubiblt^ eode writ ten by a 
progiammer. Topically I lie basir prcx esjsots are on a low 
abstraction level The ATP adapter is the hit erface to ATI^ 
tliat allows ciata to be sent to ATP and received from it. This 
functionality is encajjsulated and offered as aji API, 

A format processor can be used withm ATP in tJie following 
way: 



FORMAT MyFomiat Processor <- "$MY_FP_EXECUTABIiE" 
/* Declare the use of a format processor. */ 



BEGIN [ My Format Processor ] 



END t MyFormatProoeesor ] 
/* Use the format processor, 
ing inf ormation » */ 



sending and receiv- 



First, a specific syntactical construct introcluced witli the 
kevT^'Oi'd FORMAT is used to declare the use of a format prt> 
cess or. It is then tlie task of ATP to control and to communi- 
cate with tlie foni^at processor. All infoimation enclosed in 
the syntactical bracket BEGiN [MyFarmatProcessDrl and END 
iMyFormatPrDcessDrJ represents a code template for the named 
fomiat processor. ATP generates tlie actual code block frcan 
tills code template by substituting the actual tjarameter val- 
ues for the co<le template parameters. Then this code block 
is sent to the fonnat j^rocessor for immediate execution. The 
fomiat jjrocessor receives the code by railing API bmctions 
pro^itled by the ATP adapter. The protirietary part of the 
format processor processes the received information. The 
format processor can send back hiformation to ATP. ATP 
receives this iufonnation and logs it to the output file or 
redirects it to another fonnat jDiocessor. 

Remote Formal Processor, W a programmer needs to integrate 
a format processor on another machine, for example on a 



PC ninning Windows '-' NT. this can be specified in the FOR- 
MAT cieclaralion. No additional effoit is required for the 
programmer fo establisii a remrHe fonnat [irocessor. The 
adaptation for remote c(jntrol is done by ATP automat i rally 
by adding a distribution adapter. 

Concatenated Forinat Processor. Another feature of the ATP 
language is the abihty to concatenate existing furinal 
processors. 



FORMAT X <- . . , . , 

FORMAT Y < - . . * . . 

FORMAT Z <- X I Y 

/* Concatenate fonnat processors X and Y 

This is similar to UKIX pipes. */ 



to Z. 



BEGIN [ Z ] 



ENTJ [ Z, ] 



The code b]r>ck betweei^ BEGIN \Z] ami END [Z] is lirsE sent lo 
format processor X. The output of fonnat processor X is 
sent to format prfjcessor Y. For fomiat t^'ficf^ssor Y it makes 
no difterent e where the information ( oniey front, that is, the 
concatenation is medial er! by ATP autoniatit ally. Fomiat 
jnocessor Y sends its ontjjuT t)ack to ATP. 

ATP in the Patient Monitor Test En\ironment 

The concepi belviint ATI* Ciised ils inlegralion into (he ija- 
tient monitor test environment, Tlie impetus to develop this 
concept came from om' exi>eiience with tlie lest environ- 
ment, as described at the begiitiiing of this anlcie. But the 
concept is more general. It is not rest ic ted to the j>atient 
monitor test environment . ATP cm\ be used to attack many 
different problems. 



J^ 



ATP 



./^•* 
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Adapter - 



Basic 
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Fij?. I. Fctniiai jirocessor functional blocks. 
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Level of 
OetsiiE 




Failure 
DBScriptian 



AutQinalic Ta^ Envifanmflnt 



Abstraction 
Level 



-^ Manual Task 
=f Automatic Task 



Fig, 5* Current. ATP integi^tion 
in tjip patjent monitor test, envn- 
romnenl (phase J). 



The iniegmtion c»f such a tool liito an existing enviroimient 
Is a challenging task, ATP, like eveiy tool, requiies some 
effoit to build up the jiecessaiy infrastructure, to support 
the tool, iinrl to Ipiini the new language. A step-by-step, three- 
phase integration of ATP in the test process was planned. 

Phase L Deveiop apatieTvi'inonitor-relevant repository' strut - 
ture iJiai, is easy to use and maintain. 1b parallel, implenienr 
a set of primitive functions to fill the repository. Tlien test 
the structure on new patient monitor fiuictionality. In this 
phase ATP {kn^a not specify ajiy format pr(K-cssor execut- 
able, that is, AuioTesi and AutoC'hetk cu-o not integrated as 
format processors. ATP wiites ttie code block inmiediately to 
the output file. Tlie generated code is then processed in a 
postprocessing step. 

Phase II. Enhimce AutoTest and AutoClieck wiili ATPadat>t- 
ersso thai they can be integrated as forjuat processoiii. 
Then the same fitnctlons used in (ilvase I can l>e executed 
ininicdiately wiUtout dte postpnjcessing steps needed in 
phase L The test tools are invoked by ATP- 

Phase III. Complete the function repository^ and niigi'ate, step 
by step, the existing test package to ATP functions. Then the 
testing package can be used agtun for new patient monitor 
produci^s simply by rephicing the fonnat j>rocessors by new 
ones and by substituting some primitive futvctiorus. 

Currently the phase I integration is completed (see Fig. 5). 
A repository structure lias been proposed and evaluated iti 
some projects. Iri these jjrojects lest engineers use ATP to 
automate the tests, ATP generates AutoTest and AutoCheck 
(X)de, wliich is then piissed to AutoTest tuid AutoCheck for 
t^xecutioti. F'or phase 11 integration, otrly the declaration of 
the AutoTest mid AutoCheck format piocessors wili change. 



They will then specif mtegratable AutoTest and AutoCheck 
format processors. This phase is currently in progress. 
Phase in has i>een staried. 

ATP Integration in Phase 1: An Example 

'fhe following ATP liuiclion illustrates haw ATP generates 
AutoCheck code. This function is the CheckAlarm funcUom 
caileci in the heail rate alarm test used in the example 
presented earlier. 

BBFINE CheckAlarm ( IN AlarmDelay TYPE IN 

IN AlarmString TYPE IN 
{"STRING"} 
) 

DESCRIPTION 
PURPOSE : 

Check if alarm is present after a specified 
delay. 

SIGNATURE : 

CheckAlam { <AlarmDelay>> <AlarjaString> ) 

END DESCHIPTION 

FORMAT AutoTest <- " " 

FORMAT AutoCheck <- " " 

/* At the moment AutoTest and AutoCheck 
a.re not really fonnat proceBsorg* The 
declaration of AutoTeot and AutoCheck 
doee not epeclfy any format processor 
executable, in this case ATP writes 
the code block immediatly to the 
output channel, i»e, ATP generates 
AutoTest /AutoCheck code. */ 
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LOCAL sound 



IF 



^" == Alarms tring [1, 3] THEN 
/* Is it a red alarm? */ 

Bound <- "red" 

ELSE 

/* NO ==> yellow alarm */ 

sound <- "c_yellow'' 

END IF 

/* Very critical alarms are announced as 
red alarms whereas less critical 
alarms are announced as yellow alarma. 
Parallel to the visible colored alarm 
string a corresponding sound is 
audible, i.e. for red alarms a red 
alarm sound is audible and for yellow 
alarms a c yellow alarm sound is 
audible. */ 

BEGIN [ AutoCheck ] 
^ Verify begin 
^ Alarm & f 1, AlarmString) within 

f<^f l^AlarmDelay) ,NaN) ^ 
* sound is @(1, sound); 
^ Verify end 
END [ AutoCheck ] 

/* AutoCtieck code generation* 

The code template includes ATP 
variables, which will be evaluated 
at run time. */ 

END CheckAlaon 



AlihoiigJi I lie tiifreiit ATP integration m only the first phase, 
ATP has proved lu be k\ poweifiil fool Tor HlUiekin^? aiicl solv- 
ing coiTiplex test it jg problems tliat iniuu'wise woultl nut have 
been solved in thie same time? frame. Like every new tool, at 
the begum ing sornf" eflort is ieqitire»rl to ieiuri the languagtv 
jVlsfj, the test engineers liave to implement a ser of ininiitive 
functions to build a pow<-r1ul limetion rejiository. Neveillie- 
less, ( HJi" experience has sltown that productKity increased 
significantly and that ATP hel|>etl to ensure the preciictaljilily 
of product releases. 

After a few days of nse tfie lesl engineers felt comfortable 
enougli lo ilevelojj their first automatic tests with ATP and 
were able to us^ the ftmction repository. 

Tbsts aie nmch more so]>histicated mu\ effective dian before. 
Tlie same tests written directly in Autotesl/AiitoClieclc code 
would have probably required tlu-ee times more development 
time without reaching the simie degree of rehaljility* flexibil- 
ily, ajid nuuntainaliiiity. ll^e test en^^neei-s using ATP iisert 
tiie increased productivity to iJiink about better test designs. 



Failures have been found earlier because of higher lest cover- 
age, especially from the use of rm\dom test data feneration. 
These failures would not liave been detected in the valida- 
tion phase widi Ihe existing static test. The risk ul missing a 
faihii'e is Iherefcire induced by ATP. 

The reduiidancy of the tests is nnu li lower. Test engineers 
are now able to adapt their test procechires rapidly to 
chimged system behaviot: In inf>st (tases tiiey just have to 
u[iriate some c:oiLstantB. 

The higlier abstraction level of tlie test procedures enables 
the test eugintvr to use the same test fnocedtnvs to test 
new patient monitor products. Tlie adaptation requires the 
substiftition of some low-level primitive functions and the 
format processoi's. 

Implenientaticin 

Tlie ATP iriteii Hitler is implemented in C on a workstation 
nmning the HP-L'X='= operatmg system. Most of die leiujsitory 
tools have bet^n wriltcMi in the ATP language itself. This illus- 
trates iliat Al'Pis not only a language for tonimiating lest 
procedures. 

Tlie architecture follows the classical compiler architectiue, 
Tlu^ front end with lexical inialysis, s^nUictical mialysis, and 
semantic analysis Ls similai' to other compilers for high-level 
formal languages. The back kmi\ consis(s n\ Mie code genera- 
lion rnudLde tmd the conmiLmitalitMi module, which mmiages 
the format processor communication and other fimctions. 

C0m-]iisioii 

The ni'w ATP kmguage bridges the gap between high-level 
test design luul low-level antomatic test piocediires. Tlie 
integration of ATP hito the test environment has increased 
productivity and reduced rednndancy More imp<u'Umf, the 
quality of the testing process has increased wiih the use of 
tills al>slract high-level programming language. Migration of 
the test iirocedure set to new^ products is now nmcli easier 
beianse most of I he code ctm be leiised. 
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An Automated Test Evaluation Tool 



The AutoCheck program fully automates tiie evaluation of test protocol 
files for medical patient monitors. The AutoCheck output documents that 
the evaluation has been carried out and presents the results of the 
evaluation. 

by Jorg Schwering 



Tlie AutoCheck prqgrBm extends the autontated tesi enviroiv 
ment descTibecl in this journal in lf>91 J It fu!ly automates the 
e\^ualion of Uie tesi protocol files genemted by tiie AuioTest 
pnograiB, 

Fig. 1 ma brief smiimar>^ of the 1991 Siystetti. The main part 
of the figure shows the main elements of tlie AutoTest pro- 
giam aitd lis environment. The AutoTest program reads and 
jnferi>rets commands line-by-line out of a test script fa test 
procedure). Basically there are thiee tyi>es *jf commajids: 



• Commands to siniiilate user input on the monitor under test 
fke>pusher) 

• Commands to control the signal simulators, which play the 
role of a critically ill patient 

• Commancis to request and log output data from the monitor 
under test (reactions to keypresses and signals applied). 

The execution of aU commands and the data received from 
the monitor are logged into a protocol file. Fig. 2 is an exam- 
ple of a test protocol file (one test case only). 
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08:S7 
08157 

{ 06:57: 

09:57 

09:58 
08:59 
08:59 
D9;00 
09:00 
09:00 
09:00 
Q9;00 
09:00^ 
QS :QQ: 
09:00; 

09:00: 
09:00; 

Tuned 

09:00: 
09:00: 
09:00: 

09:00: 

09:00: 

09:00: 

ADATl; 

(a 

09:00: 

09:00: 

09:00: 

ADATl: 

09:00; 



35 

35 

35 

!35 

37 

:^9 

: 3 3 

:2S 

;35 

;15 

:%¥! 

:15 

:1S 

:35 

;3S, 

t3(S 

:35 



//^ ^. „ 

// Te^Bt case 6) "^ — 

//Adjust HR alarm limlta to 50^80 
Berlin param 



Merlin f7 
Berlin f3 -n4e 
Merlin f6 -n4 9 / 
Merlin f 4 -n7^' 
Merlin ffS" -ia34 
// Set HR to 120 



sial 

// delay 

wait 3?^^^ 

/ / ? ? ? ? ZJ- ^ ? ^ ? ? ? ? -2-£2? ???????? 7? ??????7???7?7?7???7???????????????^??? 

V'/ Verify that: 

// HR = 120; 

/y alarm "HR 120 > 80^ i. 



35 //_ ?Z?7:?7 7777??:2! 2J*?2J^?? ? ?7 ???? ????????????? 77? ?7?7??7?? 77 ?????????? 
Bljcnecif tune HR ECG-CHl 



HR -WJ, ECG-CHl "WS, 



36 A "** HR 120 > eo 
0.6/ 



3^1 HF 120 V 



Extreme BraHy: 



"OR 33 ECG-CHl -WS p= ovY- 

2752 1492 mVL -ps- 

"^^^^ SD'^ /SO 30 /250 

20 



40 



liCS^CHlT 



p= ovY- /min aji 



3T].A^** HR laO^ ^j^^-.y-'^'^rg 33 ECG-CHl -WS p=---ovY- 40 4 

38; (Wim 16/ 27 52 1492 mV p = o--- ECG-CHl 

38; HF 120 50 /SO 30 /250 p=---ovy- /min HR 
"Extreme Brady: "0 20 



3a A "** HR 12 > 80 
39 (Wl) II le/ 
39 liF- 120 



OR 3 3 ECG-CHl -WS p-^ ovY~ 40 4 

2752 1492 mV p^-^_o--- ECG-CHl 
50 /eo 30 /250 p- qvY- /min HR 
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Tlm« Stamp 



Keypusher coFiimands to 
s«t a Earn Eimits m 50-BO 



Simulator command in set 
heartralB \u 12Q 
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40 EOT 







2 



09:00:40 // 



— - Alarm sidng 



Data btocjcs read 



Besides the alarm string and 
the HR numenCr much related 
data such as units arthe 
alarm limits is received. Most 
of those attributes should also 
be checked. 



Fig, 2. All example of a tost protocol file} Lor one tesL case. 

The upper left block in Fig. 1 indicates how the test scripts 
are derived from the product specifications. More informa- 
tion on the testing pjocess can be foiuid in tfie article on 
page B9. A test script consists of a staling) configuration 
block, which con figures the monitor to a defined startup 
condition, and the test esses. Each test case consists of 
three paits: 

• The actions (kejpusher and simulator conmiands) 

• The descrijition of t he expected results 

• The AiitoTest data request commands. 

The upper right comer in Fig, 1 shows the automatic evalua- 
tion tool» which is the subject of this article. 

Manual Evaluation 

In the test en^ironmeivt of Fig. 1, the test engineer had a tool 
that ran setjuences of lest scripts in batch mode, completely 
unattended, overnight or on weekends, Witli the mtroduction 
of AutoTest, the main effoil for tlie tes! engu^eer shifted from 
test execurion to test evaluation, which was done wiOi tiie 
help of an editor. The protocol files generated by AutoTest 



(see Fig. 2) are ASCII text files that iire veiy often laiger 
than one megabyte (some large tests have reached a size of 
more than 100 megabytes). 

The evaluation task was not only tedious and tmie-consuniing 
but also error-prone and dependent on the ki^owledge, expe- 
rience, cind even tiie mental alertness of the evaluator. As a 
consequence, the manual checks were restricted to a mini- 
mum for each test case^ which tyijicaHy meant tiiat unex- 
pected attributes were not detected. Furthermore, for tests 
that needed to be rejieated. tJie e\'al nation was nonnaUy 
restricted to a few (one to tlnee) selected repetirions. 
Statistical tests, such as actjustmg an alarm limit randomly 
and checking the alarm string, generate particularly large 
protocol files that are difficult to evaluate niaivualiy leading 
the test et\gineer to reduce the number of test cases to a 
minimum. 
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Statentfim 



Error Me^ssag^s 



Evaluariofi Check 



Fig. 3. Tiic use nvcidel for Aulo- 
1 -ht* k r*'plat f5 niiiniul eval!.mti(fM 
vvirli tluvaiEloinatic yvaJuation Tool. 



Goals for an Automatic- Test EvaJiiation Tool 

Betaiisi^ of th(*!50 |>roblenLs, £ui invt*stigaLicHi wits started on 
a \tm] I hat couki replace the huoiiui evahiator. The goals for 
thisautonmlic^ pvaluatioii tool were: 

• To relieve the tesl engineer of tedious, time-consutnmg 
maiHia! evahtation ami thereby increase efficiency 

• To a vo i ( iO \ e rl t >o ki ng ( I i nn ej^a n ci es 

• To gel the tei:!l results faster by quicker evaJ nation 

• To increase test coverage through side-effect checks 

• To make evaJii;ition more objective (not test erf le}Km<lej^( ) 

• To aik>w rondjliojial checks (tlexitaliiy) 

• To aiitoniate loeal kuiguage regression tests (see article, 
page lOJV). 

Use Model 

The basic use model (see Hg. 2) is the replacemeiil (trinanna! 
e\-i^Matioti wUli the anioniiitit^ ev;iJnaiiou looL The evaluation 
of ifit^ jjroiocol nU' nms after ilie test hits Rnisht^d. The tt^si 
files already coniain liie expected restilts coded in a lonnal 
lajiguage re*t<lable by Aii(oChe<'k. 

Test exerution and evaluation now ronsisls of Ihe following 
steps: 

1. Wriie I he AufoTesi testscTipi m chiding the expected 
results in Autotheck fonnat. The hiisic tt^st scrip! layout as 
described al>ove stays ihe smite, llw otdy diffeiences jue 
(hat some AutoChe<*k definhions sucb as tcilerances (see 
■;Aut<jf'heck PVat tires and Syntax" below) aiv added to 
(he Stan up hUK'k and liiat I tie (JescriptJon ofitie expected 
results has to follow tJie AutoCheck fomiab 

2. Run this test sc ript through the AutoCheck syntax check 
to avoid ust^less AtitoTesl rujis. 

'•}. Exetitle ihe test script with Autt)Tesi as ttsual. The ex- 
peetect results ( AutoCbeck statements) are treated by Aiilo- 
Test as c*onimeiits. whicti means fhal ihey are only copied 
into the proiotoi file to|J(iher wiiJi a finie staniii, 

4. Run Ihe protocol file through tlie AutoCheck evaluation 
check, which includes a synlax <heck. Autorhcck generates 
a diff file repotting the tle\ iatioiis from tlic* expected results 



( errors) ajtd wamingji for everything that couldn't lie e%^alu- 
ated or is stispicions m smwe other w^ay (for details see 
"AutoCht'ck Output " bekjw). 

5. If and only if AutoCheck reports errors or warnings, 
check the [irotocol fik^ to find tmt whether thc^ fle\iation is 
catised by a Oaw in the test siiiiif or a bug in Ihe patient 
nionitor under test. 

Arc hi tee til re 

We first conducted a feasibiUty study, wiiich investigated 
different aicliitectural approaclies and implementation tools. 
The first approach was in the area of aitificial intelligence, 
namely expert systems and language rcrognitiou fthis would 
tie expecterl for an iuvcstigaliiai stalled in IMiU). h soon 
bei-ame ai>parent that protijcol file evaluation is l^tisically 
'd corniiiier problem, llie kmguages and tools investigated 
w(Mc f*rolo.i:/Lisp. seM'NlX" shelf [ex/yacc. i\ and a ( -style 
macro language for a [jrograniiualile editor. We cauu^ to tlu^ 
cone his ion that a combination oflex/yacc ajul V would lead 
to the easiest and most flexible solulion, 

Fig. 4 shows the AutoCTtec^k architec^ture. The pi'otocol file 
is first nm through a |iret>rocessor, which removes all lines 
inelevant to AutoCheck. identifies the ckfferent AutoTest 
intei^ices. and perfonns the local language translaticms, 
Tlieieafter it Ls aiudyzed l>y a c<mibuiation of a scanner and 
a parser. We implemented specialized scanner/parse i*s for 
the /\ntoChe<-k metakuiguage and the data providetl by tke 
different jjalitvut mc mi tor interfaces. The Anbj( lierk state- 
ments and the AutoTest data are wiitten into septiraie tiata 
structures. A third data slni(Hure holtls some control param- 
eters such as the accepted tolerances (see "AutoCheck 
Features mid Syntax*' below). After each data package ^ 
wiiicb is the answer to (jue .XiMoTest data request t*ommanrl^ 
the compare function is started. Tlie compare ftmction 
wTites all deviations into the enor file*. 

Basically, AutoTest and AuloClieck iecogni/.e two types of 
data nNjuests: f?liigie hnms, which resfjond with exactly one 
data set lor vnvh retiut^sted niessage, atitl (fiftflnnoiis hfncs, 
whirh gal lit ^r data over a defined time inienal 
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Fig. 4. Aut^Cheek arc! litec tare. 

Jji the monitor lainiiy luider test all importaj^t data haa an 
update frequency of 1024 ms. AutoTest groups all data mes- 
sages received within a 1024-ms frame into one data hlook 
and AiitoCheck treats each data block of a continuous tutte 
like a data package of a single tune* 

AH AutoCheck stateruenls ai'c then verified against each 
data block. The AutoCheck statements remain valifl mid 
are applied to the next data block untO they are exphcitly 
cleared or overwritten by a new AutoCheck block. 

AutoCheck Features and Syntax 

The AutoCheck features and syntax are best described using 
the example shown in Fig. 5. The numbers below corre- 
spond to ttie numbers in the left, column of Fig. 5. 

L All AutoCheck slateinents are preceded by a caret (^) and 
are treated as comments by AutoTest. As mentioned above 
under "Architect lire," AutoCheck statements aie giouped 
into blocks. Eacli block is enclosed by tlie tw^o statements 
Verify Begin and Verify End. 

2. There Is a set of AutoCheck statements that enables the 
user to veiift^ all data tliat can be read by AutoTest (ruini erics, 
alerts, sound, wave data, task window texts, etc.). Au exam- 
ple of a niunerical %^alue is tempera! lu^e, including all of its 
accompanying attributes such as units (^C) and alann limits. 
In this exa[uple Ihe value of the temperature nimieric Ls 
expected to be 37.0'C, 

3, Verify statements can be combined with: 

• A negation, for example to clieck the absence of an alann 

• Timing conditions, for example to verify that an alaiin delay 
is within its specified range. 



In this example it is expected that in the time mterv^aJ from 
6 seconds to uifinity ( NalM) there is no alarm for blood pres- 
sure. Tliis is a typical test case in which tiiere was an alann 
and the simulated measurement has beei^ reset between the 
aiami limits, the object being to check that, the alaim disap- 
pears within a defined time. 

4. For all t\unierical values (nieasurenTeuts), including those 
ill the al;irm string, a tolerance can be defined to compen- 
sate for simulator tolerances. Tiie I olerances are defined 
outside the Verity block in an additional block. ii\ltliough the 
user can change the tolerances as often as tlesired, they are 
tyiiically defined once at the beginning of a tost procedure 
and then used for the whole test procedure. In this example, 
all values in the range from 1% below 37-0°C to 1% above 
37.0"C (tm.7 to 37.3°C} would be accepted as correct for tlie 
Tempi parameter. 

5. There are special combinations, such as a numeric value 
and an alarm string- For instance, in the monitor family 
under test an alarm message typically indicates that tiie 
alarm limit has been exceeded. Tl\e alann lijrdt is also in- 
cluded in a numeric message along ^ith its attrjl)utes. The 
command alarm "HR" > al_max allows the tester to compaie the 
alann lin\it In the alann message with the alann limit in the 
numeric message (as opposed to checking both messages 
against a fixed hmit). Tlus feature is mainly useful for statis- 
tical tests. 

6. Simple control structiu-es (it and, or) can be iLsed to define 
different expected results for conditions that are either not 
controllable by the test environment or are deliberately not 
exphcitly set to expanrl tlie test coverage. In the monitor 
fmiiily under test some settings are dependent on the config- 
uration (e.g., patient size). The simple control structures 
allow configuration-dependent evaluation 

7. As a condition iti m\ if statement, either flags, wliich have 
to be defined eailier in the test procedure, or an ordhiary 
AutoCheck statement can be used* 



(4J A Tolerance Definition 

(4) '^ "Tempi" ; 1%; 

(4) ^ End Tolerance Definition 

f1) ^ Verify Begin 

j2){4J ^ "Temp 1 " - > value = 37.0; 

^2) ^ "Teinpl'^->iinit = C; 

{3) ^ not alarm for "Pressl" 

{3) ^ within (S^NaN); 

{5} *- alarm "HR" > al max; 

(6H7| ^ if Neonate 

(6^ ^ then 

(6} ^ "HR"-> al_min = 30? 

(6) ^ endif; 

(6K7) * if value of "Pat. Size" is "Adult" 

j6) ^ then 

(6) ^ "HR"-> al_iRin = 15; 

16) ^ endif; 

(3) '^ write "Check user input"; 

(1) ^ Verify Bud) 

Fig. 5. An exaniple of expected results UTitten in die AutoCheck 

language. The numbers at the left refer to the pardgrapiis in ihe 

article that describe these statements. 
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8. AiitoCherk prcmtles a coniniaiul to write a rtjniinc^il into 
the output tile. This caii be iisotl lo iiiHtiiHl the user^ \o cherk 
something manually (e.g., a iiscT iTipitt string). 

AutoCheclc Output 

Fig. 6 is ail example of AutoCheck oiitpul. .\iiioi hc^ck gener^- 
afesse%eri flitlerent out](Lil types: 

• Kvaliiatinit Kmtr. The ex pec led (taia an<l IIh^ recciveti (lata 
(hyn'l malt 1l 

• Evaltiation Warning. Auiut'lieck ecniklnl tIcltMiuiiu* wht'lher 
tiit^ ciaia is eorreet (e,g,, data missing). 

• AiUfjTest Knor. Errors reporterl by AiiloTesI are niirrorefl 
in the oulpnt llle \o make them vlsililc* to Ihe lei>t engineer, 
who only l(j«ks at the protocoi file in caise of reported 
einjrs, 

• SjTitax E\rm\ The int.eq)retation «irth(* AutoCheck s^yntijx 
failetl. 

• Syntiix Wanving. The AutoC'heek syntax could be inlerprt^ted, 
but is sii.%peeted to be incomplete or wr'oiig. 

• Data Error, TJie AutoTest data roukin't b«^ inlerj^reted eor- 
reetly. This inihrait'S either a rornipfeil imWcKnl file or an 
incnm|jalit>ihty of AiUot heck and AuIoTpsi vt^r.sions. 

• Writt>. This ih a user-defmed output, tl eiiatjh'B the irser to 
mark data that should be checked manually, sueli as user 
input at a pause statement. 

Ttu' user can clioosc tietween four different warning hovels 
for the syniax ant! cvahuiMoji Wrunings and can swikh 
mdividujiJ warnings on or ntr. 



Tlic oiitpnT i^<Mu rated by AutoCheck htis the fonowing format: 

ErrorType : statement line, datallne > 
descriptive text 

Thus, boih I he line eontaining the AutoCheck sfatement and 
the line containing the data are indicated. 

If the output is writtt^n inio a file, each line is pnn-eded by 
llu* fHenameisiatementlineL This is tire same foniiat jls used by 
many conrpilers, ajul therefore the built-in tuacros of many 
jHograonning editors i an be used in cf>mbi nation with Autf^ 
ClK^rk. This rnciuis ihal erriMs can h<^ rt^vic^wed in niucli the 
same way that a source file is debujE4jfed alter compilation 
using an editor pointing to tiie source code errors. 

At the end of the evaluation* AutoChec^k gives the test engi- 
neer a tjuick overview^ of the result by providing a table 
showit^g how^ many output messages of eacli tyiie have lieen 
gentvrated. Whereas the cvahialion errors indicate bugs 
ejtlier hi the product or in the test scrijit. Hie otlier outi>ui 
messages indicate potential pnjbleius in the test execution 
or evalnatitJU process. 

The AutoCheck output documents botjt tliat the evaluation 
bus ticen canied out and die result cjf Ihc {^vahiation, which 
tor lucdical prothiclsare Jm[iortmit for jegulalnry approvals 
aiif) audi Is. 
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Platforms 

AutollHH k and Aiifotpst run on different platfontis. Auu>- 
Tesl runs on a DOS " -based PC', which is \-ei'y appropriate as 
a test fontroller hecaust* of the inexpensive hardware, an 
operating system that doesn'l rrquhe dealing witli tasl<ing 
eonnicls, anri tlio availability of interi'are cards (tfie iiiter- 
fare to Mu^ nieriif-al niHworlc is availal:)le as a PC" card mdy ), 
AuloC heek mns on vi UN'IX-luLsod worl<station beeaiise of 
(he avail ai)ility of lex/yacc and the greater resourres (rn(*mory 
and processing power). However, both tools work on the 
same file system (a LiNlX-liased LAN server). The user 
d f J t \sn t tin \' e to wo i r y n t> o u I t h e d if f e r en I fi 1 e f f >r 1 1 la I s , 
because AutoCheck automatically takes care of the formal 
conversions. It accepts both DOS and UNIX formats and 
gemnales the output aecordiiig to the detected iirotocol file 
hmuHL Having dillen'nl machines for execution and evalua- 
tiiMi has also not proved to be a disadvantage for the lest 
engineer. 

E xpanda bility 

The basic architecture of AutoCheck has proven to be flex- 
ible for enhait cements over time. Since the first release of 
AutoCheck w^e have implemented many enhancements 
because of new^ product fcalmes and because AutoTest 
provides additional data stnu'tures. 

Validation 

The risk of the AutoCheck approach is that, if AutaCheck 
overlooks an error (false negative output ). the testt^r won*t 
find the error An automatic evahiadon tool is only useful Lf 
the tester can rely on it, since othei'W'ise^ even if no errors 
were rcpoiled, the tester would still have to look at the pro- 
ttKol file. Therefore, the validalion of an automatic evalua- 
tion tool \s cruciai W the success of such a tool. Frtr this 
reason a ilujrougli tesi of the tool was designed and every 
new revision is regressifin testeti Clumges and enhancenient.s 
undergo a format jjj ocess slmiliu^ to ttiai iise<) for rusi tjmer 
products. 



Results 

Tlie majuial evduation time for an ovoniigbi rest of arfjund 
one to two hours has been reduced by the use of AutoCfierk 
to less ihmi a minute. This means that the additional efforl 
for the test etigineer for writing the expected results in the 
AutoCheck syntax is compensated after tliree to five test 
runs. This rh^penrls oil die ext^enenr (^ (jf the test er^gineer 
wiib AulnCiieck Uhe nonnai leaiaing curve) mni lite nalure 
of the test 

A positive side effect is that it is much easier for another 
test engineer to evaluate the test. 

AutoCheck also leads to bigger tests v^lth ;in Inereiised man- 
ber of checks for each test case, such as checks for side 
efft^cts. Such an automatic evaluation tool is also a prerequi- 
site forstatislital testing. It would t<rke too nnich time to 
evaluate all ihest^ test cases maim ally, In otlu^r words, Atito- 
C-heck leads to liiglier test coverage with lower effort tor the 
test engineer 

( )nce 1 elieved of a great deal of the tiiore mechanicaJ test 
execution ;uid t ^valuation uciivities. I he <esl engineer has 
time U> work on lu'w anxi belter ies( ajjproarhes or possibUi- 
ties for an iiTcreased autonmtion level. Over time tlris has led 
to enhuncements of both AntoTest and Antoflieck and to 
new inols hke ATI M see ailicie, page Jl-^i). 
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Effective Testing of Localized 
Software 



Testing localized software is a complex and time-consunniog task. With 
the help of the testing tools developed for HP patient monitors, local 
language validation for these products is fufly automated. 

by Evatigelos NLkolaropoulos, Jorg Schwering, and Andreas Piming 



I^jcaHzalion plays a vriy imiinriaT]! rolt^ in the successful 
market iiig of software all o\'er the world. For niecliral de- 
vice there are legal requirements to provide instninients 
and aef(jni]jan\ingcltJcimientalH>i^ in live language oFiIk* 
heal tin "a re pemonnei wtio use llieni (as is ihe rase in ihe 
Etnopean I'nion). Il is often forgotten that localiy.ed soO^ 
waie is liijfi'reui software from the original ( mosit pri>l>al)ly 
m English) that was tL*jed for system integration aiid final 
validation. Localized software undergoes a proper irttegra- 
tloti eyele (intt^gralion of stjHware <'tnd fratislaied strings) 
;uul must he validate<i seiiarately. The comi>lexity of this 
validation is ohvious if cme considers the efftnts required 
to check all enor ronilitions aj\d the conf,s|)onding enTjr 
messages (antl tr> understanfl them) for software in e\'er>' 
language w'here the proiluct is mai ketcnl. 

Thf^ tnost eontmon eiiors iit k)cdized software. assuming 
\\vM live Etanslation is done hy a professional translator for 
this hmgnage ;md is correct, are: 

• Missing stnngs femt>ty messages, (jarts of screen text 
missinj^. nu-nu select iun items missing) 

• Htrings wilh wrong altrihutes (maximum length exceeded — 
a possible erash cause) or sUaiige chaiacters fill in j* uii the 
re* m ai ndi ^ r of a fi e I d 

• Wron^ si rings (not reflecting the intentions of the author 
for this p arl i f ■ u 1 : u t ' oj i text ) 



' Various inisspelliugs or violations of grammar mie-s apjilied 
to the language procki ceil through the conthination of traits- 
lale<l .strings by ihe software 

' Strings tiot t>iot>erly cleared in a text field before a new 
stiiitg is {lisplayetl 

Local language tesiit^g in onr lal)oratoi^ is composed of two 
steps: the verificaf ion of the tnuLslation and the v^alidation 
(regressioJi testing J of the loeaii:«ed sofhvare. 

To verify tlie translation, a translator goes over till possible 
screens, messages, help texts, ijrintouts, and so on to check 
for translation errors. Tlie difficulty here is that in most 
cases the tran^siatoi' is nol a fre<ntenl user of I he device 
under lest, and needs a.ssisTance in f operating ihe medical 
instrument atid generating all pussil>le string combinations. 

The aim of v^alidation (regression testing) of tlie locaii;^etl 
software is to prove thai the* localization lias not negatively 
affected the functionality and perlVn'majice (jf the insfni- 
meat. Additiojial attention musi In- t^ahl to I.Vl>i( al localiza- 
tion errors (overflows or garbage generation). 

Automated Local Language Validation 

Widi tlie help of Ihe lestin^^ tools developed \ov our t>atient 
[ I ] n n i t o r s ( s t M ' Fig . I iii id Ihe ae C( jin ) )ai lying ari i c I es i n I h \ s 
issue), local language validation is a fully automated proci^ss: 
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(a) Extract of a Test Procedure for Blood Pressyre as tt Is Used for Tests of English Software 

* alarm suspended ~> alarms not suspended 
merlin mainscrn 

mecif skey -kalarmvol 
merlin "SwitcbOnAlarms" 

* = S a i^ = = = = = = = = = = = = = = = = = ==^ = = = = = —= = -=-^-=Si== = =iZSS = Ti = =^^=i=.=-^=' = 

* Verify begin 

* INOF "NBP EQUIP MAIiF" $ 
Sound is hard inop ; 

^ Verify end 

* — — — - — --—= — ^— — = — = = — — = — — = = = :=: = = = = = = = — = = = ^s=-^^^SS:=^-==S = =^'= = 

* = = - = = = = = = = = =- = = = ==:=££;£ = = = = = = = = = = = = = = = = = = = = = = = = = = = == = = == = 

*■ Verify begin 

^ twprompt is "Problems with the pneumatic are detected" ? 

^ verify end 

* ==;':?:=.^- — = = z=:^ = == = = =:===== == = =. = = = = = = =— = = = = = == = = == = = = = = = = = = 

(b) The PrDcedtire after Translation of Spftkeys {Menu Selections) to Finnish 

* alarm suspended -> alarms not suspended 
merlin mainscrn 

mecif skey ^kalarmvol 
merlin "HalytPaalle^' 

* ____—.^-.^^^^^^^^^_.^j__^^____3__..____— _______________ _2___ 

^ Verify begin 

* INOP "NBP EQUIP HALF" ; 
'^ Sound is hard inop ; 

" Verify end 

* ====' = == = ss = iss;=i = = i=si = = = s: = = ss === = = = = = = === = = = = = = = = = = = == = = == = 

* Verify begin 

'^ twprompt is "Problems with the pneumatic are detected" t 
^ Verify end 

■* ====================================================== 

(c) Protoaol File after Test Run with Software in Finnish 

23:35:14 * alarm suspended -> alarms not suspended 

23:35:14 merlin mainscrn 

23 : 35 : 17 mecif skey -kalarmvol 

23:35:21 merlin "H^lytPfialle" 

23:36:11 * ==^=== = === = === =======^^=^ = ==^ = = = = = = = = = ^==^^=,=:======= = = 

23:36:11 ^ Verify begin 

23:36:11 ^ IKOP "NBP EQDIP MALF" ; 

23:36:11 * Sound is hard inop ; 

23:36:11 ^ Verify end 

23:36: 11 * ^^^= = = = = = = =^ ===:^^^=:^=^^^=^^^== = === = = = = ========= = = = = = = = = 

Filter: NBP -NU, @ 

23:36:13 (hard inop sound AHec; CS) 

23:36:13 I "NBP LAITEVIRHE "0-1 NBP -NU p= 0--H 

23:36:23 * ==^^^ = === = = = = = = ================ ================= = ==^=^ = 

23;36;23 ^ Verify begin 

23i36;23 * twprompt is "Problems with the pneumatic are detected" ; 

23:36:23 " Verify end 

23:36:23 * ====== = = === = = = = === = ============== = ^^^=^========= = ==== = = 

Tuned : TWPROMPT, 

23:36:26 F " NBP " B, 15 

23:36:26 P "NBP last calibration done 16 KES 94 15:32 " W 3 T 9 

23:36:29 F "Pneumatiikassa on havaittu cngelmia " W 3 T 

23:36:31 EOT 



Press a hardkey 

Make a selection from a menu 

Verify statement {AutoCheckf 



Verify statament (AutpChack) 



Press a hardkey {no transtation) 
Make a selection from a menu 

(translated] 
Verify statement (AutoCheckl 



Verity statement (AutoCheck) 



String translated 



Combined string onlv partfy 

translated 
String translated 



Fig 2. Example of the steps of the lotial language test proems. 
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1. Translation t^lps called ktati tangmige iabif's are prepared 

from iho native language support database ('{intaining all the 
Eiiglisli strings ajid their corresponduig traiislatioivs. 

2. A test package, a subset of the lest prfX-echires designed 
for the regression testing tjf the origtmil English version, is 
compilefi, 

3. A capy of eacli rest pnx-^lure out of tJiis pa(^kage is trans- 

laitHl into tlu^ im-aJ liyiguage by a tool developed by software 
qnaJilTk- engineering c^lic*<l skey. Tlie intention here is to re- 
t)Iaee in the test prorerlures the st-lection menu items fsoft- 
ke\'«) with the con-esponding kn ahzed terms. Of eom-se. a 
test can be executed by passing ilie position of a selection 
item (e.g.. "prt*ss the second seletiion on the third menu"") 
to the test execution ttjol but this approach has provc*d to 
be inelTectivp, The issue here is not lo test that tiie "s**contl 
selection of the tliird menu" work-s, J)e(^ause litis was already 
rested in Etighsh, but to prove that the "sc^eond selection oi 
the third menu** is translated eoirectjy. ;ind if selected, pro- 
duces exactly tlie same beha\1or as its Eugiisii countt^rpart. 
By calling the selections by their values and noi by tlieh 
positiotis we achieve higher test coverage and we test func- 
tifmaJity and tmtislation at the same time. Another argmuent 
for tJiLs apijroach is that the "second selecti<m oti the third 
menu " may i>e configuration det>eiMlenT [even local <"oiirign' 
ration dependt^nt) and therefore not accessibh^ by a ijosilion 
depeufient test (e.g., it is still on the third menu hut hi dte 
sixth place). Tluis, calls by value make test procedures more 
1 ijbusl . 

4. Tlie translated test procedin-e is passed to Aiif r/lest ^ and 
is run ou tlie localized so ft w- are. The results are savetl hi 
ju'otocol files (ontainitig English verify statements flhe 
expected results). localized softkeys (selections), atid local- 
ized actual lesidts ^see I he examj)le in Fig. 2). 



5. The protocol 61 es are submitted to AutoCheck {see 
article, page IfJ^l). First, tlie AuloCheck preprfK*ess*jr takes 
over the task of traiLslatiiig tlie verify staienu^nts. It uses the 
local language tables to replace the English text in the verify 
clauses with the localized text. On the second pass these 
translated ex(3ecte<i results iire conij>ared wltli tire localizeti 
actual resTilts. Biscri*|>ancies are reported in ilie normal way 
iHit with localized cotitetil, 

A special solution is also provided for Asiaxi languages 
Isimplifted and tradiiioiud C'hli\ese and Japanese I. which 
use l(5-bit codes. For tiiese languages the hexadecimal 
fc^iuivalent for each character is used in the test |)rocedures 
instead of the "tlrawir cliaracten Tliis enal>les us to keett 
sut*h charactei-s in ASCII files (like tlie test jirocedures atul 
die protocol files) and use tliem with the test execution and 
evaluation tools. 

Tlie automateii loc^al language Viilidation has cLramaticaily 
unproved the process iif UK^alized stiftware release. It has 
reduced the effcnt for local language testing for a new 
patient monitor release from twelve to four weeks and has 
significantly increased tlie test coverage compared with tire 
traditional niEuiual testing approach. 

A ek n o w I edgmen t s 

The auiiiors wish lo thank Cjcrhard Tivig for hii^ support 

dining tire development <:jf the local language test tools. 
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6 H P D e sk J et B20C Pri nter 

David J, Shelley 

^^^^^■M| Dave Shelley is the MD 
^^^^^H^^H program manager for new 
^^^^^^^^^1 pr] nter pro d uc ! s at HP s 
^^f^j^ f^^^H Va^icouuer DivisEon and 
^B ^ ^1 managed the developmem 
^B^ J^ fl of the HP DeskJet B2QC 
^H ^1 primer, Previously he worked 

■P?^|ft|MP ^ 33 a quality ergineermg 

^^^^^^" s ect i on man ag er, and ea rl i er 
as the R&D secttoo manager for the Qu<et Jet family 
of printers, Dave received a BSEE degree from the 
University of Washington in 1973. Aiier graduating 
he joined HP's San Diego Printer Division and did 
electrical engineering and design for several printers 
and plotters. He then transferred to HP's CofvalNs 
site, worked on calcuJator peripherals, and was the 
RSD project manager for the HP ThinUet printer. 
Bofn in Seattle, Washington, Dave is married and 
has three sons He is a ham radio operator and works 
with the local county's emergency services. His other 
interests include outdoor activities such as cycling, 
racquethall, camping, and boaiing, 

James Ma jewski 

An R&D project manager 
at HP's Vancouver Division, 
James Mgjewski was 
responsible for the analog 
ASICs and the eiectionic and 
'Tiechanic^l product design 
r ^ _ Por the HP DeskJet B20C 
j^^^^t.^*-- : printer.Previouslyheworked 
' -^^^^^^ an the electronic product 

design for the DeskJet B50 printer. He received a BSEE 
degree in 1 380 from the University of Illinois. After 
graduaiing he joined HP's Colorado Springs Division 
and worked on the HP 64QDQ family of emulators, 
providing customer support as a product marketing 
engineer. Later he did electronic hard ware and firn^ 
ware design for various HP BBxj^k emulators He 
earned an MBA degree from the University of Colo- 
rado at Colorado Springs in 19B8. Bofo in Chrcago, 
lllrnois, James ps married and has two children He 
enjoys spending his free time playing sports, espe- 
cially golf, and supporting children's sports activities. 

Mark R. Thackray 

Mark Thackray is an R&D 
project manager St HP s 
Vancouver Diviston and is 
currently located in Barce- 
■■^' w I on 3 , Spsi n, whe re f)e is 
^^ r managing an BSD team 

j^^r ' designing a new DesignJet 

J^^ I arge f o rniat pr i nte r, Fo r tfi e 
^"^ I a St f ittee n y eac s h e ha s 
worsted as an engineer and ASIC designer developmg 
several generations of printers leading up to the HP 
DeskJet B2DC, including the HP DeskWriter 55D, HP 
DeskJet 56D. and HP DeskJet 64D, He is named as 
an inventor in a patent involving automatic clock- rate 






swaching for micraprocessor controllers. Mark re- 
ceived a BSEE degree in 1981 and an MSEE degree 
in 1987, both from Washington State University, 

JohnLMcWilliams 

Author's biography appears elsewhere in this section. 
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David M.Hall 

A software engineer at HP's 
•.■--r:r]tjver Division since 
l;'JX David Hall worked on 
the software driver for the 
HP DeskJet 820C primer 
Born in Chi CO, California, he 
earned a BSEE degree in 
1991 from California State 
University at Chico. He is 

professionally interested in object-oriented design. 

Before joining HP he hefd a summer student position 

at the Chevron Oil Field Research Center, David is 

married and enjoys restoring IVtGs,. 

Lee W. Jacksort 

A software engineer at HP's 
Vancouver Printer Operation, 
Lee Jackson is responsible 

'^■'^ ^ ^° '' ^^^ ^^^^' ^^^^ nt f f i rm- 
ware for the next- genera I ion 
rnkjet printer He contributed 
to the design of the printer 
dnver for the HP DeskJet 
8Z0C. He received a BS 
degree in computer science from the University of 
Washington in 19B6. After graduating he joined HP's 
Logtc Systems Division and has held a variety of sofr- 
ware and firmware positions working on embedded 
development tools and Inkjet printers. Lee is married 
and has one child. H(s hobhtes include juggling and 
hiking, 

Katrma He lies 

Katrina Heiles is a software 
P engineer at HP's Vancouver 
Division. She worked on the 
swath manager for the HP 
DeskJet 82 DC printer, focus- 
ing on the PPA data format- 
ting and servant architecture. 
Previously she worked in 
computer integrated manu- 
factui ing on the automatfon of inventory storage and 
delivery fo? the DeskJet manufacturing process. She 
IS professianally interested in object-oriented design 
and IS a member of the HP e-maii mentor program. 
She received a BS degree in the political economy of 
nature [ resources in 1 987 at the University of Califor- 
nia, Berkeley. She then received a secondary math 
teaching certificate m 1989 and a BS degree m com- 
puter science in 1993. both from Portland State Uni- 
versity. Before joming HP she worked at United Tele- 
phone doing inventDry accounting and econometrJc 
forecasting. Bom in Pnnceton. New Jersey, she is 
married and enjoys outdoor sports such as windsurf- 
ing, skfing. running, and .soccer. 






Karen E. Van der Veer 

Karen Van der Veer has been 
an R&D software engineer 
-' HP s Vancouver Printer 

vision since 1992. initially 
■'.e. implemented the I/O 
iin-:hitecture tor the first 
! ii'SkJet printer with bidirec- 
■ 'lal communication, She 
went on to become the soft- 
ware designer for the PPA status and I/O stack for 
the DeskJet a20C and designed the new PPA 1/0 data 
link protocol She worked as the technical product 
team lead for the HP DeskJet B90 primer. She is 
currently working on daisy chain and I/O industry 
standards and is named as an inventor in a pending 
patent for the Vlink 1/0 protocol Karen earned a 
SS degree in computer science from the California 
Polytechnic State University m 1992 and is currently 
working on an MS degree m computer science at the 
California State University at Chico. 

Thomas 1 Halpertnv 

A software development 
engineer at HP's Vancouver 
Pnnter Division, Tom Hal- 
penny worked on the PCL 
emulator and DOS redirector 
for the HP DeskJet 82DC 
pnnter driver and is cun-ently 
doing firmware development 
for the next- generation ink- 
jet printer. Since joining HP in 1973 he has developed 
software and firmware for numerous inkje[ printers 
and plotters. He is professionally interested in video 
engineering and is named as an inventor in two pat- 
ents, one involving the sorting and reordering of vec- 
tors plotted in a vector pen plotter and the other con- 
cerning the monitormg and controlling of the quality 
of pen markings on plotting media. He earned a BS 
degree in engineering from Harvey Mudd College in 
1973 and an MSEE degree from Stanford University 
in 1975. Tarn is married and has two daughters. His 
hobbies include taking care of more than twenty acres 
of forest and meadowland on his property. 
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ErikKilk 




Pi 




A development engineer at 
HP's Vancouver Divtsion, Enk 
Kilk worked on the I/O and 
general architecture for the 

■ i'lP DeskJet eZOC printer and 

■ne firmware technical 
lead for the next -gene rat ion 
printer Since joining HP in 
I9B6 he has worked as a 
firmware engineer on the HP 70000 Series spectrum 
analyzers and on several earlier DeskJet models. 
Hb is a member of the American Association for the 
Advancement of Science and is named as an inventor 
in three patents involving I/O protocols and user 
interface improvemeni on instrumentation, Erik 
received a BS degree in phystcs from the University 
of California at SerkeEey in 1984 and an MS degree in 
computer science from the California State University 
at Chico in 1 990. Born in Baltimore, Maryland. Erik 
enjoys dancing (mcluding swmg. country, and ball* 
room), skiing, and iravallng. 
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John L McWtMiams 

John McWilltams has beer 
i'^ ^^ 7*1^81- 3T HP's Vart- 
I vtsionsm^ 1993 

d-ij 'cCEmty wGfk€d m the 
ASIC design for The HP 
DeskJet aSJC prtrriei He i£ 
a member of the IEEE and 
has auttiDred a paper on 
integfalKJ circuit design 
for low-cosL high'vofume consumer prod jcis. He 
rsEeived a BSEE degree m 1991 from Lafavette Col- 
lege and an MSEE degree in 19^3 from the UniversiTy 
of CaEffornia at Berkeley Born in Grove City, Penn- 
sylvania, he enjoys sports such as NayaMng. wmd- 
surfing, snnwb&ardmg. and backpacking. 

Leann M. Macft/lillan 

Uann MecMiElian is a hard- 

..:jre design engineer at 
• - £ Vancouver Division and 
;. irked on IC design for the 
--^ DeskJet 8Z0C printer and 
i.-evLOusEy Jor the HP Laser- 
J Lit 3 and 4V printers. She 
earned a 8SE£ degree in 
19S9 from the University of 
Idaho and an MS degree in eiectricai and computer 
engineerrng in 1392 from Carnegie Mellon University 
She joined HP m 1989. leann was born rn Scotts Bluff, 
l^ehraska, is marraed, and has ana infant daughter in 
her free time, she enjoys skiing, fitness, and spending 
time with her family 

Bimal Pathak 

Bjmal Pathak has been a 

hardware design engineer 
at HP's Vancouver Division 
since 1992. He warlcedon 
ASIC development for the 
■ T' DeskJet 540 pnnter and 
T.-CGntly on the HP DeskJei 
820C, where he was respon- 
sible for the I/O DMA, DRAM 
control J er, and I/O circuits. He started his career at 
Texas Instruments in 1 984. workmg on custom iC 
designs far fast SRAM memones and for high-speed, 
floating-point digital signal processors and coproces- 
sors Bimal earned a BSEE degree in 1984 from 
Cornell University and an MSEE m 1987 from Rice 
University, In 1992 he received a Master of Manage- 
ment of Tachnoiogy from Portland State University 

Harlan A. Talley 

Harlan Talley is an engineer/ 
scientist at HP's Vancouver 
Division and was the ASIC 
development lead for the 
HP DeskJet B20C printer 
Previously he worked as tha 
electrical engineering lead 
on the HP DeskJet BSD 
printer Harlan received an 

MSEE degree in 197D and an MSCS degree in 1989. 

In hks free time. Harlan enjoys traveling, photography, 

and any iron men la I advocacy. 
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HustOB W. Rice 






An R&D engineer St HP's 
Vancouver Drvision. Htjgh 
Rice worfced on the analog 
system design, carfiage 

I— -n^H toand design, and head drive 
<v^ ^H ASIC design for tne HP 
I 1^1 i^eskJeta20C printer He is 

J ^pr jl^H professional I V interested m 
anaiog a nd d ig ital system 
DEsign tor printers and in teaching the fundamentals 
of electronics and pnnter operation to new employees. 
He is named as an inventor in three pendfng patents 
on switching povver supply design and pen control 
circuits He received a BSEE degree m 1984 from 
Rensselaer l^olytechnic Institute and an MSEE degree 
in 1992 from Sianforrf University. Since joining HP's 
Santa Clara Division |SCD| in 1984, his projects have 
included acting as the production engineering manager 
for the frequency counter product line at SCD and 
working as an R SO engineer on digital communica- 
tion ICs at HP's Comnrtuni cations Components Division. 
Born in Cupertino, California, Hugh is married and has 
four sons, He is active in his church and in his free 
time enjoys shotgun trap shooting, soccer, ultimate 
frisbee, and automotive repair and maintenance 
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Terry W. BlancNard 

'I An R&D section manager 
_ at H P 's S y St em s Tec h no lo gy 
!/isian, Terry BEanehard 
iiiiinages the team responsi- 
i lii' for providing CAO tool 
'•iutions for HP CPU designs. 
■ ludingPA^RISCandtheHP 
,ji:d Intel joint architecture. 
Recently he was responsible 
for Uii; rA ;^,J... ..ore microarchitecture design, 
architectural verification and behsvroral modeling of 
the CPU core, and FET level simulation to analyse 
electrical properties Earlier m the project, he also 
managed the memory and l/Q controller venfication 
and the cache datapath physical design. He is profes- 
sionatly interested in computer architecture, functronal 
verification, CAD tools, and external partnerships and 
is a member of the IEEE. He earned a BS ECEN degree 
in 1969 from Bngham Young University. He joined HP 
that same year and some of his favorite projects 
smce then include working on the PCXS CPU and 
memory controller lor the Series 70D workstations, 
the PA 7100LC CPU. and briefly on the PA 800O CPU 
before moving to the PA 7300LC. He coauthored an 
article on the design methodologies used in the 
PA 71D0LC. Terfy is married and has four daughters. 
He enjoys spending time with his family. He is also 
interested in music (vocal piano, and composition), 
landscaping, woodworking, hiking, and fishing. 
His civic activities include beir^g an assistant scout 
master, an HP e-mail mentor, a science fair judge for 
a local school district, and an autive member of his 
church. 







PaulG.Tobin 

H^gpi^H^s An ^M^ project mana^ 
I ^F^ ^m -■ the En^irsefJT^ Syjt^Tts 
I ^L^ _M aboratory at HFs SystMHS 

f j^^il^^^ achnofogy DtVESion. Paul 

'Mn 5S currently managing 
CAI] software dfi\*slopnmrl 
for highpeffarmance micro- 
orocessor design He re- 
cently worked on the HP PA 
13WLC microprocessor contrtbutrng to the ffosting- 
pomt control design, dei)ug enhancements architec- 
ture and implementation, and functional verificaiTons. 
PreviDusiv he worked as a design engjneer on the HP 
PA 7100LC microprocessor, doing functional verifica- 
tion of the CPU core and memory and f/0 controller 
He is named as an inventor in five pending patents 
on debug hardware for the PA 730DLC and has coau- 
thored an article explaining how the PA 730DLC pra- 
cessor integrates cache while considering cost and 
performance factors. He is professjonally interested 
m high-speed simulation, formal verification, and 
CAD frameworks and is a member of the IEEE. He 
received a BSEE degree in 1987 from the University 
of Notre Dame and an MSEE degree m 1988 from 
the University of Illinois, Before joining HP in 19BB, 
he worked at the David Sarnoff Research Center in 
Princeton, New Jersey, domg software prototyping of 
wide area network designs. Paul is married and has 
two children. He enjoys photography, snow skiing, 
water skiing, and playing with his kids- He also plays 
trumpet in a local concert band. 
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Leith Johnson 

I' ^^^^^^ ^ Leith Johnson is a member 
t ^^V^\^^ [) f the technica i sia ff a t H P's 
' ^« ^-^ ''^* S ystems Techno logy D ivi s io n 
and IS currently working on 
the next-generation mid- and 
high-end core electronics 
complexes. He recently 
worked on the memory and 
second 'level cache central ler 
design for the PA 73D0LC CPU. Since joining HP In 
197B, his memorable contributions include working 
on the bit-slice processor for the HP 9845 desktop 
computer, the processor board design for the HP 3000 
Model 825 and HP 9000 Model 925 computer systems, 
the memory controller design for the onginal HP 9000 
Series 700, and the memory and I/O controller design 
for the PA 7100LC. He is professionally interested in 
computer system design with an emphasis on memory 
controllers and is named as an inventor in five patents 
related to high-speed computer bus design and in 
two patents pending for his work on the PA 7300LC 
Born in Seattle, Washington, Leith was awarded a 
BSEE degree in 1978 from the University of Nevada 
at Renn and an MSCS in 1988 from Colorado State 
University. In his free time, he enjoys outdoor sports 
such as back country skiing and bicycling and also 
likes to play pinball, 
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Stephen R Undv 

An engineer/sclentfst in the 
engineering systems tabora- 
tory at HP's Systems Tech- 
nology Division, Steve Undy 
fecentty worked as the 
rnicfoarchitett anilie daia 
C3cliefortheHPPA7300LC 
processor and is currently 
responsible for the instruc- 
tion cache or a future microprocessor. He is profes- 
sionally inter esied in computer architecture, ceches, 
and microp recess or verilicetion and coauthored sev- 
eral articles on these subjects fortt^e HP Journal, 
CompCon confeffinces. and the JEEE, of which he is 
a member Steve received a BSE d eg nee in electrical 
engineering and a BSE degree in computer engineer- 
ing, both from the University of Michigan in 19B3. He 
then earned an MSEE degree in 1385 from Purdue 
University. He joined HP's Inforination Hardware 
Operation that same year and his two favorrte projects 
since then include designing the instruction and data 
cache for The PA 71 DDLC and working on the HP 9D00 
Series 7Q0 computer, designing the CPU cache control 
and verifying the custom mefnory and I/O controller 
chip. Steve was born in Detroit, Michigan and is 
married. He enjoys outdoor activities such as sailing, 
scuba divmg, cycling, astronomy, and woodworking. 
He also enjoys theater and reading. 
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Davtd C. Kubicek 

David Ku bice if has been 
^^ __ ^^^— ^ member of the technical 
\^K ' ^^^B ^^^^^ ^^ ^^^ engineering sys- 
^^C . ^B tems laboratory at HP's Sys- 

tems Technology Division 
since 1994. He recently 
worked as a designer on the 
HP PA 7300LC processor, 
where he was responsible 
for the mmp^]sitiGn of the instruction cache controller 
and the system -to^ester correlation for the processor 
Professionally interested in high-speed circuits and 
destgn-for-test, David received a BSEE degree with an 
emphasis in VLSI circuits from Iowa State University 
in 1994. He was born in Waterloo. Iowa, is married, 
and has a one-year-old daughter He enjoys being a 
father and participating in outdoor sports such as 
mountain biking and skiing. He also plays mustc and 
before attending college he played guitar and sang in 
a heavy metal band. 

Thomas J. Sullivan 

A. member of the technical 
^taff at HFs Systems Tech- 
nology Division since 1994. 
Tom Sullivan recently 
worked on the control physi- 
cal design, library develop- 
ment; and synthesis support 
foriheHPPA73GOLC.Heis 
currently pan of the floaung^ 
point design team for future microprocessor products. 
He is professionally interested m VLSI and DSP appli- 
cations and is a member at the IEEE. Tom received a 
BSEE and BSCS in 1983. an MSEE degree in 1984, 
and a D-Sc. degree in electrical engineering in 1993, 






all from Washington University in St. Louis As a 
research associate at Washmgion University, he 
worked on custom VLSI devefopment for DSP applica- 
tions, researched SIMD and MIMD processor arrays, 
and taught several classes. He also worked as a VLSI 
designer at McDonnel- Doug las before coming lo HR 
He is named as an inventor in three patents related 
to custom VLSI used in digital hearmg aid design. 
He coauthored several papers on VLSI as applied to 
hearing aids and on massively parallel SIMD archi- 
tectures. Tom is marned and has a son. Hts hobbies 
include softball, black and white photography, wood- 
working, and following baseball. 

Amitabh Mehra 

Amitabh Mehra has been 
a member of the technical 
staff in the engineering sys- 
tems laboratory at HP's Sys- 
tems Technology Division 
since 1994. He has worked 
on the global composition 
fof the PA 73D0LC micropro- 
cessor and IS currently doing 
this kind of woric for the PA B500 microprocessor 
Amitahh received a BSEE degree m 1993 from the 
California Institute of Technology and an MSEE 
degree in 1994 from Stanford University. Born in 
Green Bay, Wisconsin, one of his interests can he 
summed up in two words. *'Go Packers," 

John G, McBride 

A member of the technical 
staff in the engineenng sys- 
tems laboratory at HP's Sys- 
lems Technology Division 
John Mc Bride is currently 
writing netlist quality tools 
for custom VLSI designs. 
i^.-^ Recentlyheworkedonthe 
physical design of the in- 
struction cache for the HP PA 7300LC and is named 
as an inventoi' in four pending patents about his 
work Other memorable projects since joining HP in 
1988, include being a printed circuit board designer, 
product engineer, and test engineer for the HP 
C2Z04A disk drive, and later, a test engineer on the 
LaserJet 4ML and an ASIC designer on the LaserJet 
5P He has authored an article analyzing RAID perfor- 
mance and is named as an inventor m a patent on 
disk array implementation Born in Vernal. Utah, he 
received a BSEE degree m 19B7 from Brigham Young 
University and an MSEE degree in 1991 from Stanford 
University While in school he had summer intern- 
ships with IBM, the U.S. Natmnat Forest Service, and 
the Dakin5 Corporation. John is married and has one 
son, two daughters, and will soon have a newborn. 
His hobtiies include home improvenients, white 
water rafting, and tinkering with cars. He is also 
active in the Boy Scouts of America and is president 
of the Sunday school pfogram at his local church, 
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Duncan Weir 

Duncan Weir has been an 
engineer at HP's Systems 

Technology Division since 
198G, after graduating with 
a BSEE degree from Wash- 
ington University in St Louis. 
Since |ommg HP he has 
done design verification 
work on a variety of HP pro- 
cesic;:. riiiL) uv,\t\ LlHps, including the HP PA71Q0, PA 
710DLC, and PA 7300LC processors, He coauthored 
an article on the design methodology used for the PA 
7100LC. He is named as an coinventnr in a patent for 
the hardware TLB handler for the PA 7100 and in a 
pending patent for his recent work on the PA 7300LC 
random code generation, an area of professional 
interest lo him. Duncan was bom at Clark Air Base 
in the Philippines. He is married and liNes to run and 
enjoys tekmg his dogs out. He also enjoys playing 
indoor soccer 

PauJG.Toliin 

Author's biography appears elsewhere in this section. 
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Lin A. Nease 

■■^^^^^^H Lin IMease is a project man- 

^^^^^PI^^H ager for the system design 
^^f^ ^^1 team at HP s Enterprise Sys- 
^V , ^^1 terns Division and recently 
WB ^ f^l ^^5 ^ system architect on 
V «^^/ ■ ''^'^ D -class server He earned 
^ 1 : d BS degree in computer 

W6l ^ "^ science from Arizona State 

^^^ - ^ '- Universityin 1966. While 
in school he did datahase development for the Uni- 
versity's career center and for SCS /Compute, Inc., a 
ta)c processing service After graduating he joined the 
information technology group at HP's Entry Systems 
Operation. Memorable projects include doing firm- 
ware design and performance characterization for the 
HP 90DG G-, H-. and l-class seivers. early system defi- 
nition for the HP 900D K-class server, and firmware 
design for the HP 9DD0 Bx2-class servers. In 1994 Lin 
received an MBA from California State University at 
Sacramento Born in Hendersonville. North Carolina. 
Lin sen/ed as a Sergeant in the US Air Force for four 
years He is man'ied and has three children, 
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HA harcfware dss^ engines 
at HFs Enterprise Systems 
Ofviiion, KifkBresniic^ re- 
cent^ worked ofi the electn- 
C3i tngpn^fmg design, par- 
titlonrng 0f hsnjware. and 
processor motlyla design fuf 
the HP 9000 Senes 800 0- 
clBss server He ts currently 
tttieenhani^HPPABOQO 
U-claii jifiiCtiSSQ: upgrade- He is pfofessfonalily inier- 
ested in h^gh-speed digital design artd design tool 
environments He autttored an article in the HP Jour- 
nal in June 1984 on FA-RlSC symmeiric multipfocess* 
ing in midrange servers He joined HPs Systems 
Technology Divisien in 1989 after receiving a BSEE 
degree from Santa Clara University Some of tiis 
respansjbElitJes have included domg the processor 
board design for ttie HP 90D0 and HP 3DO0 G-, H-, and 
J-ctass servers and working on the initial electrical 
engineermg desjgn for the HP 9000 D-class processor 
and I/O hardware. Born in Hayward, California, Kirk is 
married and is a new father, In his free time. Kirk en- 
joys cooking, savoring micro- and home-brewed beer. 
and collecting and reading classic literature. 

Charles J. Zacky 

A software development 
engineer at HP's General 
Systems Division. Charles 
Zacky was the team leader 
for the development of the 
server firmware for the HP 
90D0 D-class server and is 
currently responsible for 
follow^on products. He 
joined HP in 1988 at tha Bdblingen Medical Products 
Division, where he worked on ultrasound obstetrical 
scanners. In 1990. he transferred to HP's General Sys- 
tems Djvision and since then has been dojng firmware 
development for the HP 9000 Series 600 products. 
Charles received a Master of Music degree in music 
composiTEon m 1983 and an MS degree in computer 
science m 1997, specielizing in software engmeermg 
methodotogy, both from the University of Montana. 
He worked as an instructor at the University of 
Montana in the music departmefU teaching electronic 
music and in the computer science department he 
developed a noninvasive, continuous-wave blood 
pressure monitor. In his free tfme. Charles pursues his 
interest m primitive living skills and has taken several 
trips, including one to the Utah desert, where he sup- 
plemented his diet with grasshoppers and another to 
Northem Alberta, where he slept m a snow cave in 
40-degree temperattires. 

Michael J. Greensjde 

Mike Greenside is a me- 
chanical engineer in the 
oroduct design group at HP's 
Enterprise Systems Division 
and vi/as the lead engineer 
for the HP 9000 D-class 
server. He is cun'ently re* 
r, . Y_ Sponsible for future high-end 
^ ' "' server product definition and 

design He received a BS degree in mechanical engi- 
neering and material science in 1981 from the Uni- 
varsity of California at Davis He has been at HP for 





fifteen years and seme of his favonte projects jrK:iyde 
d^i^ng the penp^al bay for the HP 9000 K-class 
sever, coct^giimg th@ n^^hami^l assembly for \ht 
HP Windows CliefTt PC. and desigiting the I/O ejcpan- 
SfOfi endosune for d^ \W OTO IS30 He js pr ofession- 
ally in^rested in design fm rnanufacturabihty Bom m 
Spokane. Washington. Mike is married and has tw*o 
sons Golf and voileytjall are his favorite sports 
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AJisa Sandoval is a mechani- 
1 cai engineer in the product 



"Tin 

^^■^Vh ^ design group at HFs Enter^ 
^^K ^JKM P^^se Systems Division She 
^^B^ "^ '^^B rec&ntly workedasatber- 
^^B WL rnai Cead engineer on the 
^^^ ^^^1 pr^^u^^ definition for the 
^^^^ ^^^H HP 9000 D-class server and 
^^^* ^^^* is responsible for future 
high -end server definition and design. She is profes- 
sionally interested in plastics design, thermal design 
and heat transfer, and project management In her 
thirteen years in design and manufacturing at HP. two 
of f>er favorite projects include codes igning and stan- 
ds rdiiing HPs El A rack family and codes igning the 
mechanical assembly for the HP Windows Client PC. 
She received a BS degree in mechanical engineennr; 
from the Universiiy of Peno at Nevada in 1982 After 
graduating she worked at Becton Dickinson for a year 
m R^D and plastics design. Alisa was born in Walnut 
Creek, California and has three children She is actively 
mvolved in the visiting scientist school program and 
in Little League baseball Her hobbies include motor- 
cycles, camping, baseball, and swimmmg. She also 
enjoys arts and crafts. 
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Scott P. Allan 

Scott Allan recently worked 
as a systems architect on 

the B-class workstations at 
HP's Workstation Systems 
Division. He also defined a 
clocking strategy and de- 
signed an ASiC to control 
the processor dependent 
hardware interface. He is 
currently responsible for the memor/ control archiiec- 
tore for the next-generabon workstations Scott re- 
ceived a BSEE degree, specialismg m computer sc^ 
ence. from the University of Colorado in 1962 The 
next year he joined HP's Desktop Computer Division 
and spent three years working in manufacturing on 
HP 9000 Series 200 desktop computer products He 
has spent the last eleven years doing primarily ASIC 
design and system architecture on HP's workstation 
products, including designing all or portions of, six 
ASICs ranging in functionality from the memory con- 
troller and ECC chip far the HP 90Q0 Series 300 and 
400 families of workstations, to a bus bridge for VME 
embedded workstations and a serial port megacell 
used in more than ten ASICs at HP Scott is currently 
working on an MSEE degree from Stanford University 
and hopes to graduate m June 1998. Scott is married 
and has two stepdaughters He has spent his life 
along the Colorado front range enjoying outdoor acti^ 
vities such as skiing, road and mountain biking, and 
triathlons. He also scuba dives anywhere it's warm. 




Bruce P Bergmann 

^^^^^ -^'"-r of tfie technical 

^BPI^^k ^s Workstati on 

^^^^H| r s Division, Bruce 

^^^B ^man recently designed 
^B -^eGSCPCl. and EISA Back* 
J^H plane arid the fast- wide 
^v^H^HaH SCSI option boards fonhe 
^^^V^V B-€laSS workstation H@ IS 
^^^ now the system architect 

and Sf^tem beard designer for a two^ay SM? (sym- 
metrical mo Iti processor) workstaiton He earned a 
BSEE degree in t975 from the Case institute of Tech- 
nology and an MS degree in electrical engineering 
and applied physics in 1976 from Case Western 
Reserve University After graduating he joined HP's 
Calculator Products Division He worked as an electri- 
cai design engineer for many of the HP 9000 Series 
3D0. 400. and 700 workstations. His favorite projects 
include, designing a graphics controller chip for the 
HP 9O00 Models 310 and 320 workstations, de-s igning 
a DMA control chrp for the Models 330. 350. and 
follow-ons. and designing the CPU board for the 
Model 382. Bruce is married and has two children. 
He has been a certified aerobics instnjctor for seven 
years and enjoys walleye fishing as a master angler, 
fine woodworking, and downhill skiing. 

Ronald P. Dean 

Ron Dean is a development 

• vineer at HP's Workstation 
^•, stems Divks ton and 
recently worked on the 
mechanical definition and 
^ ^ ^H .:i-^ign of the B-class work- 

^ '^'^^^F illations enclosure, main 
^^^ tray, and power supply. He 
is named as an inventor m 
three pending patents on the heatsink design, card 
guide, and alignment mechanism He is currently 
working on upgrades to the B Series and J Series 
workstations. He earned a BSME degree in 1977 and 
then joined HP's Calculator Products Division He 
worked on the HP 90DD Series 300 doing mechanical 
product design, including the case parts and cooling 
subsystems, and published an article about his work 
in the HP Journal He then went on to work on the HP 
'9000 Series 500 workstations and was responsible 
for the overall mterconnection strategy, case parts. 
cable, and boards He also contnbuted to the HP 9000 
Series 700 industrial computers, developing case parts, 
boards, interconnect, card cages, and backplanes. 
Born in Dearborn, Michigan, Ron is married and has 
four children He is a licensed engineer and in hts 
free time enjoys cross-countty skiing, woodvvorking, 
and bridge. 
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Dianne Jiang 

^^^^^^^^H Ar^ R8iD engineer at HP's 
^^^^^^^^^^M WDrkstaiign Sy interns Divi- 
^^HHrl^^H ;;iDn, Disnne Jiang worked 
^^K^ ^^^1 ^'^' ^^^ processor and system 
^■^ I verification for the HP 90DO 

^^L ^_,^ ^M B-cfa&s worltstaiiDn. Born in 
^^^L^ "^"^"^^1 Jilin, Cliir^a, Dianne received 
^^■^^^^1 a 6S degree in 1934 end an 
^^■^^^^^ MS degree in 1 987, faoth in 
solid state physics from Nanlcai University in Cliina. 
She went on to earn a MSEE degree in 1995 from 
Texas ASM UriEversity, where she worked as a re- 
search assistant in the electrical engifieeririg depart* 
menton a semiconductor laser signal processing sys- 
tem and in til e physics department doing research on 
electronic transport of high-temperature supercon- 
ducting materials. She pined HP in 1995 and one of 
her favorite projects since then includes designing an 
ASIC turn-on board for the HP B-class workstatfon. 
Dianne is married and has a son. In her free time she 
enjoys reading, music, walking, and famiSyacLivlties. 

D&Etnis L Fbyd 

A hardware design engineer 
at HP's Workstatfor) Systems 
Division, Dennis Floyd was 
recently responsible for the 
design of the system board 
■■■:■■ the 6-class workstation. 
liv IS currently responsible 
ior verifying a memary and 
I/O controller ASIC design. 
Dennis joined NP in 1 988 after earning an MSEE 
degree from the University of Minnesota. He spent 
the first five years at HP's Computer Manufacturing 
Division, introducing workstations and irtdu stria] con- 
trollefs into the production process. Ha then spent 
two years working on the test and verification of 
VME-based embedded controllers, Born in Danville, 
Kentucky, he received a BSEE degree from the 
.University of Kentucky in 1 986. Dennis is married and 
enjoys outdoor sports such as siciing, bicycling, and 
hikmg, 
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Evangelos f^Jko^aropoufos 

A software quality engmear 
.;* HP's Patient Monitoring 
Ziivision, Evangelos Mikoia- 
rnpoulos was the sofUvare 
quslirv lead for the latest 
patient monitor in the HP 
^^^^-.-^ ^^ OmniCare family. He IS pro- 
^Hfl^p: ' /^^M Sessional ly interested in 
^^■.^i >^^^^ quality systems, verification 
am:: va ■ Jai -.w. nnethods, and product generation pro- 
cesses. EvangeEQS joined HP in 1986 and initially 
worked on the design and implementation of order 
processing systems, then began doing spftv^fare qual- 
ity engineering for medical products. He is now re- 
sponsible for software quality assurance and pro- 
vides guidance for product development, He earned a 
master's degree in economics from the University of 
Athens in 1976 and a Diploma in computer science 
and operations research in 19S1 from the Universsty 
of FrihQurg in Svi/itzerland. After graduating he worked 
as a research assistant at the University doing statis- 
tical modeling, hfe then worked on logistic systems 
for the Greek Army, and later designed databases tor 
the National Hellenic Research foundation in Athens, 
Greece Evangelos was born in Athens. In his free 
time he likes to read and learn foreign languages. 
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Andreas Pirrung 

Andreas P?rrung is an R&D 
engineer at HP's Patiani 
Monitormg Division and is 
working on tlie design and 
implementation of the upper 
layer protocol software for 
LAN communication in an 
HP patient monitor. He is 
professionally interested in 
artificial inielltgence, machine learning, and software 
engineering. Andreas was horn in Neustadt,AA/ein 




strassB, Germany and received a Diploma in com- 
puter science from the University of Karlsruhe. He 
joined HP in 1993 as a software qualit/ engineer. In 
his free time. Andreas en|oys swimming, paragliding, 
and reading. 
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Jorg Schwering 

tA software quality engineer 
at HP^s Patjent Monitoring 
Division since 1991, Jorg 
Schwering develops and 
provides consultations on 
product generation guide- 
lines, He is professionally 
interested in product gen- 
eration processes and soft- 
ware testing techniques, He joined HP in 1 983 and 
worked half-time while attending college until he 
received a Diploma in computer science in 1991 from 
Berufsakademie in Stuttgart, Germany. Jdrg was born 
in Steinfurt, Germany, is married, and has an infant 
son. 
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Evangel OS Nikolaropoulos 

Author's Diography appears elsewhere in this section. 

Jorg Schwering 

Author's biography appears elsewhere in this section. 

Andreas Ffrrung 

Author's biography appe.ars eise.where in this sectian, 
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